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INTRODUCTION 


The  Large  Space  Structure  (LSS)  research  program  was  originally  formulated  in- 
"~tete~I982  in  response  to  titeincreasing  concern  that  performance  robustness  of  Air 
Force  LSS  (yp&^ systems  would  be  inadequate  to  meet  mission  objectives.  In  partieusr- 
uncertainties  in  both  system  dynamics  and  disturbance  spectra  characterizations 
(both  time  varying  and  stochastic  uncertainty)  significantly  limit  the  performance 
attainable  with  fixed  gain,  fixed  architecture  controls.  Therefore,  the  use  of  an  adap¬ 
tive  system,  where  disturbances  and/or  plant  models  are  identified  prior  to  or  during 
control,  gives  systems  designers  more  options  for  minimizing  the  risk  in  achieving  per¬ 
formance  objectives.  ^ 


& 


.  f 

c  The  aim  of  adaptive  control  is  to  implement  in  realftime  and  onfline  as  many  as 
possible  of  the  design/functions  now  performed  offline  by  the  control  engineerPio 
give  the  controller  ‘‘intelligence.^  To  realize  this  aim,  both  a  theory  of  stability  and 
performance  of  such  inherently  nonlinear  controls  is  essential  as  well  as  a  technology 
capable  of  achieving  the  implementation.  T  'r  'E,>  '  T  ^  5  ^  r  r  A  . '(/■•  j.-  /I  \ 

The  issues  of  performance  sensitivity,  robustness,  and  achievement  of  very  high 
performance  in  an  LSS  system  can  be  effectively  addressed  using  adaptive  algorithms. 
The  need  to  identify  modal  frequencies,  for  example,  in  high-performance  disturbance 
rejection  systems  has  been  shown  in  AC OSS  (1981)  and  VCOSS  (1982).  The  deploy¬ 
ment  of  high-performance  optical  or  RF  systems  may  require  on-line  identification  of 
critical  modal  parameters  before  full  control  authority  can  be  exercised.  Parameter 
sensitivity,  manifested  by  performance  degradation  or  loss  of  stability  (poor  robust¬ 
ness)  may  be  effectively  reduced  by  adaptive  feedback  mechanizations.  Reducing  the 
effects  of  on-board  disturbance  rejection)  is  particularly  important  for  planned  Air 
Force  missions.  For  these  cases,  adaptive  control  mechanizations  are  needed  to  pro¬ 
duce  the  three-to-five  orders-of-magnitude  reductions  in  line-of-sight  jitter  required  by 
the  mission. 

Research  is  essential  to  identify  the  performance  limitations  of  adaptive  strategies 
for  LSS  control  both  from  theoretical  and  hardware  mechanization  viewpoints.  The 
long  range  goal  of  this  proposed  research  program  is  to  establish  guidelines  for  select¬ 
ing  the  appropriate  strategy,  to  evaluate  performance  improvements  over  fixed-gain 
mechanizations,  and  to  examine  the  architecture  necessary  to  produce  a  practical 
hardware  realization.  The  initial  thrust,  however,  is  to  continue  to  build  a  strong 
theoretical  foundation  without  losing  sight  of  the  practical  implementation  issues. 


RESEARCH  OBJECTIVES 


The  aims  of  this  research  study  are  to  extend  and  develop  adaptive  control  theory 
and  its  application  to  LSS  in  several  directions.  These  include: 

(1)  Theoretical  Development:  The  initial  emphasis  has  been  on  slow  adapta¬ 
tion,  since  this  covers  many  LSS  situations.  Later  on  we  will  examine  fast 
adaptation.  The  theory  developed  here  will  provide  for: 

(a)  estimates  of  robustness,  i.e.,  stability  margins  vs.  performance  bounds; 

(b)  estimates  of  regions  of  attraction  and  rates  of  parameter  convergence  to 
these  regions; 

(c)  extension  of  the  present  linear  finite  dimensional  adaptive  theory  to 
include  nonlinear  and  infinite  dimensional  plants  and  controller  struc¬ 
tures;  and 

(d)  extensions  to  decentralized  systems. 

(2)  Parameter  Adaptive  Algorithms:  Assesses  the  behavior  of  different  algo¬ 
rithms,  including:  gradient,  recursive  least  squares,  normalized  least  mean 
squares,  and  nonlinear  observer  (e.g..  Extended  Kalman  Filter). 

(3)  Parametric  Models:  Assess  the  impact  of  model  choices.  In  particular  we 
will  examine  the  effect  of  explicit  and  implicit  model  choices.  An  explicit 
model,  for  example,  is  a  transfer  function  whose  coefficients  are  all  unk¬ 
nown.  In  an  implicit  model  model  transfer  function,  the  coefficients  would 
be  functions  of  some  other  parameters.  Implicit  models  usually  arise  from 
physical  or  experimental  data,  whereas  explicit  models  are  selected  for 
analytical  convenience. 

(4)  Adaptive  Nonlinear  Control:  Although  our  early  effort  is  to  study  adaptive 
linear  control,  there  are  many  LSS  situations  where  the  control  is  nonlinear, 
e.g.,  large  angle  maneuvers,  slewing. 


CURRENT  STATUS 


At  the  present  time  we  stand  at  the  beginning  stages  of  the  theoretical  develop¬ 
ment  in  adaptive  control.  The  result  of  recent  efforts  are  contained  in  the  selected 
papers  in  the  Appendix  and  the  references  therein.  A  summary  of  earlier  efforts  is 
contained  in  the  recently  published  textbook  Stability  of  Adaptive  Systems:  Passivity 
and  Averaging  Analysis,  MIT  Press,  1986.  This  publication  is  an  outgrowth  of 
research  supported  under  this  contract  and  involved  a  considerable  amount  of  colla¬ 
borative  effort  among  several  researchers  in  the  field  of  adaptive  control.  The  text 
discusses  adaptive  systems  from  the  viewpoint  of  stability  theory.  The  emphasis  is  on 
methodology  and  basic  concepts,  rather  than  on  details  of  adaptive  algorithm.  The 
analysis  reveals  common  properties  including  causes  and  mechanisms  for  instability 
and  the  means  to  counteract  them.  Conditions  for  stability  are  presented  under  slow 
adaptation,  where  the  method  of  averaging  is  utilized.  In  this  latter  case  the  stability 
result  is  local,  i.e.,  the  initial  parametrization  and  input  spectrum  is  constrained.  Based 
on  this  analysis,  a  conceptual  framework  is  now  available  to  pursue  the  issues  of  slow 
adaptive  control  of  LSS. 

To  remove  the  restrictiveness  of  slow  adaptation  requires  an  understanding  of  the 
transient  behavior  of  adaptive  systems.  A  preliminary  investigation  is  reported  in 
Kosut  et  al.  (1986)  which  is  reprinted  in  the  Appendix.  The  transient  behavior  of  not- 
slow  or  even  rapid  adaptation  is  a  significant  problem  in  the  adaptive  control  of  LSS, 
e.g.,  rapid  retargeting. 

Another  approach  to  adaptive  control  is  to  calibrate  (or  tune)  the  controller  based 
on  a  current  estimate  of  the  LSS  model.  This  involves  not  just  knowing  one  model, 
but  rather,  a  model  set.  This  problem,  which  we  refer  to  as  adaptive  calibration,  is 
essentially  that  of  developing  a  technique  of  on-line  robust  control  design  from  an 
identified  model.  Although  we  have  worked  on  this  problem  for  some  time  it  is  only 
recently  that  we  have  established  a  theoretical  basis  for  estimating  model  error  from 
system  identification  [see  Kosut  (1986),  a  reprint  is  in  the  Appendix].  This  research 
has  raised  many  new  questions  which  need  to  be  considered,  e.g.,  what  is  the  appropri¬ 
ate  robust  controller  parametrization;  how  does  it  relate  to  model  parametrization;  how 
to  iterate  on  the  data  if  the  estimate  of  model  error  is  too  large;  what  are  the  heuristics 
for  experiment  design. 
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COLLABORATIVE  RESEARCH  EFFORT 

It  should  be  emphasized,  and  acknowledged,  that  a  great  deal  of  collaborative 
effort  has  been,  and  is  being,  expended  by  several  researchers  in  the  field  of  adaptive 
control.  The  text  referred  to  before  is  in  part  due  to  the  two  visits  by  Dr.  Kosut  to  the 
Department  of  System  Engineering  at  the  Australian  National  University.  Support  for 
these  visits  has  come  from  this  contract,  a  travel  grant  from  the  NSF  International  Pro¬ 
gram  (INT-85-13400),  and  a  Visiting  Fellow  Award  from  the  Australian  National 
University. 

FUTURE  DIRECTIONS 

Based  on  our  recent  results  as  reported  here,  we  envision  near-term  activity  in 
several  directions,  including: 

(a)  transient  analysis  of  adaptive  control; 

(b)  analysis  of  adaptive  calibration; 

(c)  decentralized  control  structures; 

(d)  effect  of  nonlinear  and  infinite  dimensional  phenomena; 

(e)  effect  of  different  algorithms  and  parametrizations. 
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Issues  in  Control  Design  for  Large  Space  Structures 


Robert  L.  Kosut  and  Michael  G.  Lyons 
Integrated  Systems  Inc 
101,  University  Avenue 
Palo  Alto,  Ca.  04301 


Abstract 

The  development  of  a  design  methodology  for  the  control  of  Large  Space  Structures 
(LSS)  involves  many  different  issues.  In  this  paper  we  present  a  selective  discussion  of  the 
theoretical  and  practical  issues  that  seem  most  relevant.  The  discussions  cover  various  types 
of  control  design  procedures,  including  both  robust  (non-adaptive)  as  well  as  adaptive,  with 
an  emphasis  on  their  practical  use. 


1.  LSS  Control  Problem  Setting 

1.1  Control  Design  Objectives 

Problems  associated  with  vibration  control  and  accurate  pointing  of  LSS  systems  typically 
involve  a  combination  of  the  following  control-performance  objectives. 

1.  modal  damping  augmentation  to  enhance  transient  settling  or  improve  quasi-static 
vibration  propagation  behavior, 

2.  stabilization  of  the  attitude  control  system, 

3.  eigenvector  modification  to  reject  narrow  band  steady-state  disturbances,  and 

4.  maneuver  load  management  to  minimize  structural  loads  or  modal  excitation  (transient 
or  steady-state). 

1.2  Modeling 

The  basis  for  selecting  a  control  strategy  must  include  an  adequate  description  of  the  rel¬ 
evant  structural  dynamics  together  with  a  description  of  how  system  performance  is  to  be 
measured.  Initially,  continuum  models  were  suggested  as  the  basis  for  proper  system  design 
since  discretization  of  the  model  could  be  postponed  or  eliminated.  Unfortunately,  practical 
spacecraft  configurations  do  not  present  simple  boundary  conditions  or  simple  shapes,  hence 
partial  differential  equation  (p.d.e.)  representations  are  nearly  impossible  to  write.  However 
such  continuum  models  have  provided  useful  insight  into  appropriate  discrete  representa¬ 
tions.  Finite  element  models  can  provide  adequate  fidelity,  at  least  over  the  frequency  range 
needed  for  the  control  design  model,  and  are  supported  with  sophisticated  software  tools 
easily  adapted  to  the  needs  of  control  design  [1]. 

1.3  Two-Level  Control  Architecture 

The  natural  structural  properties  of  LSS  systems  compel  the  use  of  a  two-level  control  system 
architecture  as  shown  in  Figure  1.  The  two  levels  are  a  colocated  rate-damping  control  system 
and  a  noncolated  high  performance  control  system.  The  colocated  system  consists  typically 
of  rate  damping  devices,  either  active  or  passive,  and  requires  a  coarse  knowledge  of  system 
dynamics.  These  are  inherently  robust  but  yield  low  performance.  They  essentially  provide 
a  wido-band,  Low-Authority  Control  (LAC)  and  are  often  referred  to  cs  the  LAC-system. 


Figure  1:  Two  Level  Control  Architecture  for  LSS  Systems. 


The  high  performance  control  is  non-colocated  and  requires  accurate  knowledge  of  critical 
modes,  and  hence,  is  very  sensitive  to  disturbance  and  structural  parameter  variations.  This 
controller  system  is  essentially  a  narrow-band,  High- Authority  Control  (HAC),  and  is  referred 
to  as  the  HAC-system.  Typically,  HAC  provides  high  damping  and  mode  shape  adjustment 
in  selected  modes  to  meet  performance  requirements. 

LAC  synthesis  principally  involves  passivity  methods  and  rate  feedback  mechanizations, 
usually  with  colocated  actuators  and  sensors  [2]. 

HAC  synthesis,  in  addressing  performance  goals  associated  with  dynamic  wavefront  and 
line-of-sight  error  suppression,  requires  high  modal  damping  and  mode  shape  changes.  Hence, 
HAC  is  dependent  on  accurate  narrow-band  models.  For  such  requirement,  it  is  essential 
that  control  design  techniques  manage  both  dependence  on  model  fidelity  and  system  gain  in 
regions  where  model  fidelity  is  poor.  This  has  generally  been  accomplished  using  fixed-gain 
robust  control  theory,  [4].  With  this  architecture  it  is  likely  that  only  the  HAC  would  be 
tuned  by  an  adaptive  system  since  the  LAC  is  inherently  robust. 

1.4  Adaptive  Techniques 

In  general,  uncertaintites  in  both  disturbance  spectra  and  system  dynamical  characteristics 
limit  the  performance  obtainable  with  fixed  gain,  fixed  order  control,  e.g.  HAC  system- 
The  use  of  an  adaptive  control  mechanization  where  disturbance  and/or  plant  dynamics  are 
identified  prior  to  or  during  control,  gives  system  designers  more  options  for  minimizing  the 
risk  in  achieving  performance  benchmarks. 

In  the  case  of  LSS  systems,  the  performance  levels  are  extremely  high.  Hence  it  is  nec¬ 
essary  that  disturbance  and  plant  models  are  accurately  known.  Since  model  data  obtained 
from  ground  testing  is  unlikely  to  sufficiently  match  the  actual  on-orbit  system,  it  follows 
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Figure  2:  Closed-loop  System  Performance  vs.  Structural  Parameter  Variations. 


that  on-line  procedures  are  needed  for  identification  and  control. 

The  generic  properties  of  closed-loop  system  performance  vs.  structural  parameter  vari¬ 
ations  are  depicted  in  Figure  2. 

2.  Control  Design  for  HAC/LAC  Architecture 

In  this  section  we  wil  discuss  the  steps  involved  in  control  design  for  the  HAC/LAC  ar¬ 
chitecture.  Although  the  architecture  is  specialized,  the  control  design  methodology  is  not 
and  can  be  quite  general.  We  will  discuss  three  methodologies  for  design:  (1)  an  LQG 
based  methodology  whose  genesis  is  the  ACOSS/VCOSS  programs,  and  (2)  a  more  recent 
approach  involving  what  is  known  as  “Q-parametrization”  and  Hoo-optimization" .  These 
latter  methods  are  frequency  domain  oriented  rather  than  state-space  oriented  like  the  LQG 
approach.  (3)  We  will  also  discuss  an  adaptive  control  strategy  which  can  be  utilized  for 
online  self-tuning.  We  refer  to  this  approach  as  “adaptive  calibration” . 

2.1  Limitations  of  Design 

Independent  of  the  design  method,  the  defining  characteristic  of  the  vibration  control  prob¬ 
lem  is  that  there  are  an  infinite  number  (theoretically)  of  elastic  modes,  with  low  natural 
damping,  and  the  controller  bandwidth  extends  over  a  significant  number  of  these  modes 
(Figure  3).  The  low  frequency  modes  interact  not  only  with  the  attitude  controller  but 
contribute  directly  to  the  deformation  geometry  of  the  structure  which  itself  may  require 
accurate  control.  Proper  control  synthesis  requires  that  performance  criteria  be  precisely 
formulated  or  the  control  problem  i»  ill-posed. 

The  control  design  approach  must  properly  handle  the  poorly  known  higher  frequency 
modes  by  not  destabilizing  them  while  controlling  the  low  frequency  modes.  Indeed,  no 
matter  where  the  controller  roll-off  frequency  is  situated,  the  infinite  nature  of  the  modal 
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Figure  3:  Flexible  Structure  Mode  Location  and  Controller  Bandwidth. 


spectrum  implies  that  there  will  be  modes  within  and  beyond  the  roll-off  region.  Further* 
more,  destabilization  is  likely  and  almost  certain  to  occur  in  the  roll-off  region,  a  situation 
which  can  only  worsen  for  closely  packed  modes  and  low  natural  damping.  This  phenomenon 
sometimes  referred  to  as  “spillover”  is  one  of  the  most  crucial  problems  faced  by  the  control 
designer.  In  more  general  terms,  spillover  can  be  viewed  as  an  aspect  of  the  problem  of 
robust  control  design;  this  will  be  discussed  more  in  a  later  section. 


2.2  Modeling  of  Flexible  Spacecraft 


A  central  issue  in  the  active  control  of  space  structures  is  the  development  of  “correct* 
mathematical  models  for  the  open  and  closed  loop  dynamical  plants.  Programs  such  as 
NASTRAN  and  SPAR  are  the  primary  current  tools  for  generating  dynamical  models  of 
conceptual  spacecraft  whose  structure  cannot  be  idealized  by  simple  models  of  beams,  plates, 
and  beams  with  lumped  masses. 

Finite  element  structural  programs  generally  provide  the  control  designers  with  a  set 
of  modal  frequencies  and  a  set  of  mode  shapes  (eigenvectors)  corresponding  to  appropriate 
boundary  values  (e.g.  free-free  modes).  These  eigenvectors  are  given  in  discretized  form, 
i.e.  a  set  of  modal  displacements  in  the  x,  y  and  z  directions  at  each  nodal  station.  In 
some  cases,  modal  rotations  are  also  required.  In  addition,  coordinates  and  a  “map”  of  the 
structure’s  nodes  must  be  provided  to  allow  the  reconstruction  of  physical  displacements  in 
terms  of  their  modal  expansions. 

The  important  point  here  is  that,  for  any  nontrivial  flexible  satellite  configuration,  the 
volume  of  information  is  so  large  that  the  data  handling  must  remain  entirely  within  the 
computer  and  its  mass-storage  facilities.  Development  of  this  database,  in  a  form  usable 
by  control  synthesis  software,  is  a  fundamental  necessity  for  the  synthesis  and  evaluation  of 
complex  control  which  require  modal  truncation,  actuator/sensor  location  and  type  changes, 
and  evaluation  of  system  performance  for  parameter  and  system  order  changes.  Preparation 
of  a  structure  for  controls  is  a  major  part  of  the  overall  effort  required  to  develop  structural 
control  systems. 


2.3  Nonlinear  Models 


For  single-body  monolithic  structures,  the  fine-pointing  attitude  dynamics  are  subsumed  in 
the  rotational  rigid  body  modes  included  in  the  modal  matrix.  When  only  “small”  motions 
of  a  space  structure  are  being  considered,  the  conventional  linear  structural  dynamics  anal¬ 
yses  (NASTRAN  and  SPAR)  are  adequate,  and  the  rigid-body  modes  are  formally  handled 
together  with  the  elastic  modes,  even  though  the  actuators  necessary  to  control  them  will 
be  different,  in  general,  from  those  used  to  control  elastic  vibrations.  When  larger  attitude 
angles  need  to  be  considered,  if  the  angular  rates  remain  small,  the  linear  equations  are 
still  applicable  provided  that  the  rigid-body  modes  are  now  given  in  terms  of  three  attitude 
angles  which  then  constitute  the  first  three  modal  coordinates.  The  displacements  are  then 
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Figure  4:  Analytical  Control-Design  Procedure . 

interpreted  as  the  linear  deformations  of  the  structure  with  respect  to  the  rotated  frame. 
This  procedure  removes  the  kinematic  nonlinearities  resulting  from  the  linear  stretching  of 
the  structure  under  the  classical  rigid-body  modes.  However,  for  large  angular  rates,  non¬ 
linear  dynamic  effects  have  to  be  modeled,  even  though  structural  deformations  can  still  be 
represented  by  linear  equations. 

2.4  Two-Level  Control  Design:  The  HAC/LAC  Methodology 

The  two-level  approach  consists  of  a  wide-band,  low-authority  control  (LAC)  and  a  narrow- 
band,  high-authority  control  (HAC).  HAC  provides  high  damping  or  mode-shape  adjustment 
in  a  selected  number  of  modes  to  meet  performance  requirements.  LAC,  on  the  other  hand, 
introduces  low  damping  in  a  wide  range  of  modes  for  maximum  robustness.  Figure  4  shows 
the  control  design  procedure  with  integrated  LAC  and  HAC  designs. 

LAC  is  usually  implemented  with  colocated  sensors  and  actuators.  However,  the  theory, 
based  on  the  work  of  Aubrun,  is  applicable  to  multiple  actuators/sensors  with  cross-feedback 
and  possible  filters  (2j. 

HAC  uses  a  collection  of  sensors  and  actuators  not  necessarily  colocated.  Selecting  the 
increase  in  damping  ratio  is  realized  by  any  number  of  methods  including  LQG  with  fre¬ 
quency  shaping,  <?-parametrization,  or  #<» -optimization.  These  methods  provide  roll-off 
over  desired  frequency  regions.  HAC  may  destabilize  modes  not  used  in  the  design.  LAC  is, 
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Figure  5:  Need  to  Integrate  HAC  and  LAC. 


therefore,  necessary  to  “clean  up”  problems  created  by  HAC. 

The  need  to  integrate  HAC  with  LAC  is  shown  in  Figure  5.  HAC  is  based  on  models  valid 
over  a  limited  frequency  region.  It  produces  large  increases  in  damping  ratio  and  disturbance 
rejection  in  the  frequency  range  of  interest.  The  effect  of  the  HAC  controller  on  modes 
not  used  in  the  control  design  and  outside  the  controller  bandwidth  may  be  stabilizing  or 
destabilizing.  LAC  is  designed  to  provide  protection  such  that  adequate  damping  is  provided 
in  the  mode  most  adversely  perturbed  by  HAC.  With  reference  to  Figure  5,  the  LAC  moves 
the  entire  uncertainty  region  above  the  zero  level  damping  ratio. 

In  the  next  few  sections,  a  more  in-depth  discussion  of  the  blocks  in  Figure  4  will  be 
presented,  in  particular,  actuator /sensor  location,  model  and  controller  reduction  methods, 
and  HAC/LAC  synthesis.  These  methodologies  rely  on  certain  properties  of  feedback  con¬ 
trol:  this  raises  the  issue  of  robust  control  design  which  is  fundamental  to  the  whole  design 
philosophy  of  feedback,  especially  for  LSS,  and  this  will  be  discussed  first. 

2.5  Robust  Control  Design 

This  section  will  describe  how  to  evaluate  the  robustness  of  a  control  design.  The  evaluation 
is  independent  of  the  methodology  used  to  achieve  a  particular  design.  To  illustrate  the  tech¬ 
nique  we  will  consider  the  robust  control  problem  of  vibration  suppression  with  unmodeled 
high  frequency  dynamics.  Figure  6  shows  the  control  system  where  P(s)  is  the  plant  transfer 
function  matrix  from  actuator  inputs  to  LOS  sensor  measurements,  and  where  C(s)  is  the 
controller  transfer  function  matrix.  Neglecting  the  rigid  body  modes  in  P(s)  and  assuming 
infinite  bandwidth  sensors  and  actuators, 
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Suppose  that  n  of  the  modes  are  known.  Let  Pn(s)  denote  the  known  part  of  P(s). 


Figure  6:  Vibration  Suppression  Control  System . 


For  example,  P„(s)  can  be  obtained  from  P(s)  by  modal  truncation,  i.e.,  the  first  n- 
modes  of  P(s)  are  retained.  One  can  ask  the  question:  is  this  the  best  choice  for  a  given 
model  order  n?  In  general,  it  depends  on  what  is  meant  by  “best”.  For  closed  loop  control, 
it  is  usually  better  to  retain  those  n-modes  which  most  affect  the  closed-loop  performance. 
}low  to  select  these  modes  will  be  discussed  in  the  section  on  mode)  reduction. 

Assuming  the  modes  have  been  selected,  define  model  error  as 

*(•)  =  /»(«)  ~  **«(»)  =  £  Gt(s). 

ten* 

Observe  that  6(s)  is  stable  because  both  P(s)  and  Pn(s)  are  stable.  Hence,  it  can  be  shown 
that  the  closed  loop  system  is  stable  if 

where  <?„(«)  is  given  by  Q„[s)  =  C(s)[/  +  P„(s)C(s)]-1  and  tf(-)  denotes  the  maximum 
singular  value  of  the  matrix  argument.  The  quantity  6,m(u>)  is  referred  to  as  the  “stability 
margin”,  hence,  the  subsecripts  “sm”.  (See  (3,4].) 

The  stability  robustness  test  depends  on  the  location  of  uncertainty.  Additive  perturba¬ 
tions  such  as  those  just  discussed  result  in  the  test  as  shown.  The  table  in  Figure  7  shows 
a  variety  of  stability  margins  corresponding  to  generic  forms  of  model  error.  In  Figure  7, 
P  =  plant,  C  =  control,  M  nominal  model,  and  6  =  model  error.  The  stability  margin  is 
expressed  as  a  function  of  C  and  M  which  are  known  quantities.  Examples  of  some  model 
error  testts  are  shown  in  Figure  8  for  the  CSDL  #2  VCOSS  model. 

2.6  Performance  Robustness 

The  stability  robustness  tests  can  be  extended  to  evaluate  performance  robustness  to  model 
error.  The  evaluation  is  determined  by  how  performance  is  measured.  Consider  the  closed 
loop  system 

V(  0  =  H(s)d(t) 

where  H(e)  is  the  closed  loop  transfer  function.  Although  d(t)  is  not  precisely  known,  it 
can  be  considered  as  the  output  of  a  weighting  filter  W  (e)  driven  by  “noise”  u >(t)  so  that 
<*(0  =  W(a)«;(t). 

Typical  performance  bounds  depend  directly  on  the  frequency  dependent  quantity 
\H{ju)W (jw)J.  A  natural  frequency  domain  performance  criterion  is  then 

a\H (j'w)W (j'w)]  <  p(u) 

where  p(<*>)  is  selected  on  the  basis  of  power,  energy,  and  magnitude  specifications  on  the 
output  signals.  In  terms  of  model  error,  performance  specification  is  satisfied  if 

<  5pm(w) 

where  £pm(w)  js  the  performance  margin  given  by 
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Figure  8:  Results  of  Model  Error  Tests. 


and  p„(w)  is  the  performance  of  the  nominal  closed  loop  system  Hn(a)  with  no  model  error. 
Then, 

p„(w)  =  o[Hn(ju)W  (ju>)\ 

which  must  always  be  smaller  than  p(<v)  in  order  for  6pm(uj)  to  be  meaningful.  Note  that 
&pm{u)  >  as  would  be  expected  since  performance  includes  stability.  As  before,  the 

location  of  uncertainty  modifies  the  calculation  of  £pm(w). 

2.7  Usefulness  of  Stability /Performance  Robustness  Tests 

The  stability /performance  robustness  tests  are  indispensible  in  obtaining  a  realistic  pre¬ 
liminary  design.  They  are  used  in  a  number  of  places  in  the  design  cycle  to  establish  the 
HAC/LAC  gains,  effect  of  actuator/sensor  dynamics,  and  the  criteria  for  model  and  con¬ 
troller  reduction,  which  will  be  discussed  in  the  next  section.  The  tests  are  also  invaluable 
in  establishing  criteria  for  online  system  identification  and  control,  which  will  be  discussed 
later  on  in  this  section. 

2.8  Model  Reduction 

In  general,  the  requirements  for  model  reduction  for  active  control  of  large  space  structures 
must  include  the  following: 

1.  The  reduced  model  should  be  suitable  for  control  design  and  synthesis.  It  should 
incorporate  all  features  critical  for  the  selection  of  a  feedback  structure  and  control 
gains. 

2.  The  reduced  model  should  accurately  incorporate  actuator  effectiveness,  sensor  mea¬ 
surements  and  disturbance  distribution  [l]. 

3.  The  dynamical  characteristics  of  interest  in  the  structure  should  be  represented  in  the 
reduced  model. 

A  basic  methodology  for  model  reduction  which  has  been  used  successfully  in  ACOSS/VCOSS 
and  a  number  of  other  programs  such  as  internal  balancing,  is  now  described.  Other  ap¬ 
proaches  also  exist  which  will  be  discussed  in  the  sequel. 

2.9  Internal  Balancing 

To  determine  the  most  important  modes  for  control  design,  many  criteria  must  be  considered 
including  controllability,  disturbability,  observability  in  performance,  and  observability  in 
the  measurements.  Any  mode  which  is  highly  controllable,  observable,  and  disturbable 
must  clearly  be  included  in  the  design  model:  however  highly  controllable-but-unobservable 
modes,  for  example,  are  difficult  to  judge.  Moore  [5]  has  developed  an  “internal  balancing” 
approach  whereby  asymptotically  stable  linear  models  are  transformed  to  an  essentially 
unique  coordinate  representation  for  which  controllability  and  observability  rankings  are 
identical.  The  definition  of  internally  balanced  coordinates  follows: 

Definition:  An  asymptotically  stable  model 

x  =  Ax  +  Bu 
y  =  Cx 

is  internally  balanced  over  [0,  oo]  iff 

J~eA,BBTeAT,dt  =  j”  tAT'CTCeAldt  =  £  * 

where 

YL  1  =  diag\o\  o\  . .  o*]  i  >  j,  of  >  of. 
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Figure  9:  Open  Loop  Modal  Analysis . 


Notice  that  the  balanced  representation  is  such  that  the  controllability  Gramian  and 
observability  Gramian  are  equal  and  diagonal.  The  <r<’ s  are  termed  “second-order  modes.” 
In  general,  the  required  transformation  “scrambles”  the  original  coordinate  system  such  that 
the  physical  meaning  of  the  states  is  lost. 

However,  for  lightly  damped  structural  models  with  decoupled  dynamics,  the  internally 
balanced  coordinate  representation  is  approximately  equal  to  a  scaled  representation  of  the 
model  states.  Thus  it  is  possible  to  write  approximate  formulae  for  the  states  in  terms  of 
the  original  model.  Three  modal  rankings  are  considered: 

•  disturbance  inputs  to  LOS 

•  actuator  inputs  to  LOS 

•  actuator  inputs  to  sensor  outputs 

These  “second-order  modes”  rankings  give  important  evaluations  about  which  modes  to 
retain  and  validity  of  a  actuator/sensor  placement.  These  rankings  are  shown  in  Figure  9 
along  with  LOS  modal  cost  [6]  computed  using  the  colored  noise  disturbance. 

Here  the  absolute  values  of  the  modal  costs  (for  the  VCOSS  1  model)  are  used.  The 
RMS  second-order  modes  and  modal  costs  are  plotted  versus  mode  number  in  Figure  9. 
Immediately  evident  is  the  clustering  of  these  modal  phenomena.  The  disturbance  eifect  as 
seen  through  the  line-of-sight  is  constrained  to  clusters  of  modes  as  is  the  ability  to  measure 
and  control  the  model.  The  coincidence  of  the  controllable  clusters  and  disturbable  clusters 
indicates  a  favorable  actuator /sensor  configuration  for  the  problem. 

2.10  Frequency  Weighted  Balanced  Realizations 

Balanced  realization  model  reduction  can  be  extended  to  finding  a  reduced  model  P„(s)  of 
a  high  order  model  P(s)  such  that 

sup? {W®(jw)(P(jw)  -  Pn(jw))Wi(j<jj))  <  1 

Ul 

where  W0(a)  and  W%(a)  are  output  and  input  frequency  dependent  weighting  matrices.  These 
can  be  chosen  to  reflect  closed-loop  requirements  on  model  error,  vis  a  vis,  frequency  domain 
stability  and  performance  margins.  For  example,  stability  of  the  closed  loop  system  with 
C(s)  designed  from  Pn(s)  is  guaranteed  if 

W0(s)  =/ 

W,(a)  =C„(s)|/  +  Pw(s)C„(e))-1. 


The  problem  is  that  W,(a)  is  dependent  on  Pn(a)  which  is  unknown.  The  let  out  is  that 
its  shape  is  partially  determined  by  the  performance  specifications,  thus,  we  can  make  an 
initial  guess.  This  technique  is  referred  to  as  "advanced  loop  shaping.”  This  involves  an 
iterative  problem  which  is  solvable  via  successive  approximation. 

2.11  Compensator  Order-Reduction 

An  alternative  to  plant  order  reduction  is  to  design  a  high  order  compensator  and  then  reduce 
the  compensator  order.  Let  C(s)  denote  a  high  order  compensator  of  order  N  designed  to 
control  /’(a)  of  order  N  or  larger.  Let  Cn(a )  denote  a  reduced  version  of  C(a)  of  order 
n  <  N.  Motivated  by  the  stability  robustness  theory,  view  C(s)  -  Cn(s)  as  a  perturbation. 
Hence,  the  closed  loop  system  with  P(a)  and  C„(s)  is  stable  if 

sup?  {W(jw)[C(joj)  -  Cn(ju)\)  <  1 

U) 

where  W (a)  =  (/  +  P(s)C(s))-1P(s). 

The  weight  W (s)  is  stable  because  the  high  order  control  C(s)  stabilizes  the  closed  loop 
system.  In  this  case  W  (s)  is  known  and  we  can  apply  internal  balancing  to  find  Cn(s).  The 
disadvantage  to  this  method  is  that  it  is  necessary  to  find  a  high-order  compensator.  The 
advantage  is  that  once  it  is  found,  internal  balancing  applies  immediately  since  the  weights 
are  known.  On  the  other  hand,  direct  plant  order  reduction  does  not  involve  control  design 
for  the  high  order  plant,  but  does  involve  an  iterative  process  since  the  weights  are  functions 
of  the  (unknown)  reduced  model. 

2.12  Low- Authority  Control  Design 

LAC  systems,  when  applied  to  structures,  are  vibration  control  systems  consisting  of  dis¬ 
tributed  sensors  and  actuators  with  limited  damping  authority.  The  control  system  is  allowed 
to  modify  only  moderately  the  natural  modes  and  frequencies  of  the  structure.  This  basic 
assumption,  combined  with  Jacobi’s  root  perturbation  formula,  leads  to  a  fundamental  LAC 
formula  for  predicting  algebraically  the  root  shifts  produced  by  introducing  a  LAC  structural 
control  system.  Specifically,  for  an  undamped,  open-loop  structure,  the  predicted  root  shift 
(d\„)p  is  given  by 

(dAn)p  *3  —  Cgr$gn4>r n  (l) 

^  a.r 

where  the  coefficient  matrix  Car  is  a  matrix  of  (damping)  gains,  and  $an,  <t>Tn  denote  respec¬ 
tively  the  values  of  the  nth  mode  shape  at  actuator  station  a  and  sensor  station  r. 

Equation  (1)  may  also  be  used  to  compute  the  unknown  gains  Car  if  the  dXn  are  con¬ 
sidered  to  be  desired  root  shifts  or,  equivalently,  desired  modal  dampings.  While  an  exact 
"inversion”  of  equation  (I)  does  not  generally  exist,  weighted  least-squares  type  solutions 
can  be  devised  to  determine  the  actuator  control  gains  Car  necessary  to  produce  the  required 
modal  damping  ratios.  This  determination  of  the  gains  is  the  synthesis  of  LAC  systems. 

For  structures  which  already  have  some  damping  or  control  systems  in  which  sensor, 
actuator,  or  filter  dynamics  can  either  be  ignored  or  are  already  embedded  in  the  plant 
dynamics,  the  root  perturbation  techniques  and  cost  function  minimization  methods  above 
can  similarly  be  used  to  synthesize  low-authority  controls. 

2.13  Robustness  of  LAC  Systems 

When  sensors  and  actuators  are  colocated  (i.e.  a  =  r),  are  complementary,  and  only  rate 
feedback  is  used,  formula  (1)  reduces  to 


Figure  10:  LQG  Control  With  Frequency-Shaping  Filters  . 


which  shows  that  the  root  shifts  are  always  towards  the  left  of  the  jf-axis  if  all  the  gains 
are  negative.  This  robustness  result  is  obviously  based  on  the  assumption  that  both  sensors 
and  actuators  have  infinite  bandwidth,  and  also  that  the  structure  was  initially  undamped. 
Several  departures  from  this  idealization  occur  in  the  actual  practical  implementation  of 
the  LAC  systems.  The  most  severe  of  these  results  from  the  finiteness  of  the  actuators’ 
bandwidths.  More  precisely,  the  second-order  roll-off  introduced  by  the  actuator  dynamics 
will  always  destabilize  an  undamped  structure.  However,  when  some  natural  damping  is 
present  in  the  structure,  or  when  a  passive  damper  is  mounted  in  parallel  with  the  actuator, 
additional  active  damping  can  be  obtained  without  destabilizing  the  structure. 

2.14  High-Authority  Control  Design 

The  HAC  control  design  procedure  can  be  based  on  any  number  of  multivariable  design 
methods,  e.g.  LQG,  Q-parametrization,  Hoc-optimization,  etc.  Increased  penalties  in  the 
LQG  cost  functional  are  placed  at  those  frequencies  where  less  response  is  desired.  The 
concept  of  frequency-shaped  cost  functionals  was  introduced  prior  to  ACOSS  [7], 

The  frequency  shaping  methods  are  useful  in  several  areas  of  large  space  structures 
control.  Three  principal  applications  are  important:  (1)  robustness  (spillover  avoidance), 
(2)  disturbance  rejection,  and  (3)  state  estimation. 

2.15  Management  of  Spillover 

Spillover  in  closed  loop  control  of  space  structures  is  managed  by  injecting  minimum  control 
power  at  the  natural  frequencies  of  the  unmodeled  modes.  Procedures  for  controlling  spillover 
at  high  frequencies  are  usually  discussed,  although  similar  techniques  are  applicable  for  other 
regimes. 

The  high  frequency  spillover  may  be  controlled  by  modifying  the  state  or  the  control 
weighting.  Conversion  to  the  frequency  domain  gives  the  following  performance  index: 


The  problem  of  robustness  (spillover  management)  is  solved  by  making  Q  and  R  functions 
of  frequency.  Figure  10  depicts  the  modification  to  the  nominal  LQG  controller.  Observe  that 
frequency  shaping  adds  filters  whose  inputs  are  the  innovation  outputs  of  the  state-estimator 
in  the  LQG  controller. 

2.16  Summary 

The  application  of  frequency-shaping  methods  to  large  space  structures  leads  to  a  linear  con¬ 
troller  with  memory.  However,  additional  states  are  needed  to  represent  frequency-dependent 
weights,  hence,  there  is  an  increase  in  the  controller  order.  The  software  needed  for  these 
controller  designs  is  similar  to  that  for  standard  LQG  problems. 


Figure  11:  The  Unity- Feed  back  Structure. 


3.  Controller  Design  Using  Q-Parametrization  and  Hoo  Op¬ 
timization 

During  the  last  decade,  mathematical  theories  of  servo  design  have  been  based  mainly  on 
quadratic  minimization  of  the  Wiener-Hopf-Kalman  type,  usually  applied  to  state-space 
models,  e.g.  LQG  controls.  However,  despite  the  academic  success  of  these  methods,  clas¬ 
sical  frequency  response  techniques  relying  on  “lead-lag  compensators”  to  reduce  sensitivity 
have  continued  to  dominate  industrial  servo  design.  One  reason  is  that  quadratic  design 
tends  to  have  poor  sensitivity.  On  the  other  hand,  the  frequency  domain  description  has 
proven  to  be  more  suitable  to  characterize  uncertainties  which  arise  in  the  plant  approxima¬ 
tion/identification,  and  frequency  domain  technique  usually  results  in  more  robust  design, 
e.g.  frequency-shaped  LQG  can  be  viewed  as  an  indirect  frequency-domain  design  approach. 

Two  direct  multivariable  frequency  domain  design  techniques  have  become  popular  in 
recent  years:  the  Q-parametrization  technique  and  the  Poo-optimal  sensitivity. 


3.1  Q-Parametrization  Design 

Consider  the  linear  unity-feedback  systems  shown  in  Figure  11  where  P(s )  is  the  given  linear 
time-invariant  plant.  C(s)  is  the  linear  compensator,  uj  is  the  reference  input,  112,  and  do  are 
respectively  the  plant-input  disturbance  and  plant-output  disturbance,  and  y2  is  the  plant 
output. 

The  closed  loop  system  input-output  transfer  function  is  given  by 


Hvu  = 


c{i  +  pc)~  1  -c{i  +  pc)-1p  -c(i  +  pcyl 
PC(/  +  PC)-1  p\i  +  cp)~ 1  (/  +  PC)-1 


(For  simplicity,  we  drop  the  argument  a  in  P(s),C(s)  etc.  in  this  section.) 
By  introducing  the  parameter  (transfer  function) 


H 


yu  — 


«1 


U2 

d„ 


yi 

y2 


Hyu  can  be  rewritten  as 


H 


yu  — 


Q  =  C(I  +  PC)~\ 

q  —QP  -Q 
PQ  (/  -  PQ)P  I-PQ 


Note  that  the  closed  loop  input-output  transfer  function,  for  the  given  plant  P,  is  com¬ 
pletely  specified  by  the  parameter  Q  in  a  very  simple  manner:  it  involves  only  sums  and 
products  of  P  and  Q 

In  a  typical  control  system  design  problem,  the  two  most  important  closed  loop  transfer 
functions  are  //y,u,  and  Hyjju:  //y,u,  •«  the  transfer  function  from  reference  input  uj  to 
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output  tft  and  HVi,t0  is  the  transfer  function  from  plant-output  disturbance  d„  to  output  yj. 
They  specify  respectively  the  servo-performance  and  regulator  performance  of  the  feedback 
system  S.  The  two  transfer  functions  are  given  by 

HyaUl  =  PQ 

and 

ffyjdo  =  l  ~ 

Therefore  the  control  design  problem  reduces  to  choosing  the  parameter  Q  so  that  the  closed 
loop  system  S  is  stable  and  that  HyaU ,  and  Hyij0  are  “satisfactory” .  After  the  parameter  Q 
is  chosen,  the  corresponding  compensator  C  can  be  obtained  by  the  formula 

C  =  Q{I  -  PQ)~l 

Hence,  there  is  a  one-to-one  correspondence  between  C  and  Q.  Consequently,  for  each 
parameter  Q  chosen,  there  is  a  unique  compensator  C  which  achieves  the  specified  Q. 

The  selection  of  the  parameter  Q  in  the  design  process  raises  several  questions:  What  are 
the  conditions  on  Q  so  that  the  resulting  compensator  C  is  realizable  (e.g.  proper)?  What 
is  the  class  of  all  Q's  which  result  in  a  stable  feedback  system?  How  is  an  “optimal”  Q  chosen? 

Realizability:  If  the  plant  P  is  realizable,  then  the  compensator  C  is  realizable  if  and  only 
if  the  parameter  Q  is  realizable.  Note  that  a  physical  plant  is  always  realizable. 

Global  Parametrization:  If  the  open  loop  plant  P  is  stable,  then  the  closed  loop  system 
5  is  stable  if  and  only  if  Q  is  stable,  since  sums  and  products  of  stable  transfer  function 
matrices  are  stable.  Consequently,  the  class  of  all  stabilizing  compensators  is  given  by 

{Q(I  -  PQ)-1 1  Q  is  stable} 

and  the  class  of  all  achievable  stable  input-output  transfer  matrix  Hya Ul  and  the  class  of  all 
achievable  stable  disturbance-to-output  transfer  matrix  Hyai„  are  given  respectively  by 

{PQ\  Q  is  stable}  and 

{I-PQ\  Q  is  stable.} 

These  sets  give  global  parametrization  of  all  stabilizing  compensators,  and  all  achievable 
I/O  characteristics  in  terms  of  a  stable  proper  transfer  matrix  Q.  In  other  words,  the  class 
of  all  “feasible”  designs  are  parametrized  by  Q. 

If  the  open  loop  plant  P  is  not  stable,  additional  constraints  have  to  be  added  to  the 
choice  of  Q,  in  addition  to  stability  and  realizability  of  Q.  For  example,  Q  must  contain 
right  half  plane  zeros  to  cancel  the  unstable  poles  of  P.  Currently,  there  are  three  approaches 
to  obtain  global  parametrization  of  a  given  unstable  plant:  (i)  Factorization  representation 
theory  [8];  (ii)  Direct  approach  [9];  (iii)  Two-step  compensation  [9]. 

Optimality:  The  Q-parametrization  alone  does  not  quantatively  address  the  issue  of  op¬ 
timal  design.  The  designer  selects  Q  from  the  class  of  “feasible”  designs,  on  the  basis  of 
the  desired  input-output  response,  a  priori  knowledge  of  external  disturbances,  bandwidth, 
dynamic  range  and  uncertainty  of  the  plant,  etc. 

Optimal  design  based  on  the  Q-parametrization  and  fractional  representation  framework 
has  become  very  popular  in  the  research  community.  The  ffoo-optimal  sensitivity  design  is 
among  the  results  available. 

3.2  //<*,- Optimal  Sensitivity  Design 

The  ffoo-optimal  sensitivity  design  is  an  extension  of  the  Q-parametrization  technique  to 
include  a  quantitative  performance  measure  of  the  closed  loop  system  and  achievable  op¬ 
timality  based  on  the  performance  measure.  Roughly  speaking,  the  //»  design  problem  is 
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Figure  12:  Adaptive  Control  System  . 


the  following:  Given  an  open  loop  plant  P(s)  and  a  low  pass  weighting  function  JV(s),  find 
the  compensator  C(s)  so  that  the  /foo-norm  of  the  weighted  sensitivity  (/  +  PC)~lW  is 
minimized  subject  to  the  stability  of  the  closed  loop  system. 

Using  the  Q-parametrization  formulation,  the  problem  is  equivalent  to  the  following: 
Find  a  Q  in  Hoo  such  that  the  closed  loop  system  is  stable  and  that  (/  -  PQ)W  is  mini¬ 
mized.  Since  the  weighted  sensitivity  function  is  affine  in  Q,  the  equivalent  problem  is  easier 
to  solve  than  the  original  problem. 

Solution  to  the  Hoo-Optlmal  Sensitivity  Problem:  Based  on  the  fractional  represen¬ 
tation  (coprime  factorization)  formulation,  several  solutions  have  been  proposed  and  algo¬ 
rithms  given.  However,  all  the  proposed  algorithms  are  conceptual  in  nature,  suitable  only 
for  simple  text  book  example.  More  effort  is  needed  towards  a  numerically  robust  synthesis 
procedure. 


4.  Adaptive  Control  Techniques 

Uncertainties  in  both  disturbance  spectra  and  system  dynamical  characteristics  will  limit 
the  performance  obtainable  with  fixed  gain,  fixed  order  controls.  The  use  of  adaptive  type 
control,  where  disturbance  and/or  plant  dynamics  are  identified  prior  to  or  during  con¬ 
trol,  gives  system  designers  more  options  for  minimizing  the  risk  in  achieving  performance 
benchmarks.  For  the  case  of  LSS  systems  where  performance  levels  are  extremely  high,  it 
is  absolutely  necessary  that  disturbance  and  plant  models  be  equally  accurate.  Since  data 
from  ground  tests  do  not  usually  represent  the  flight  condition  accurately,  it  follows  that  an 
on-line  procedure  for  identification  and  control  is  necessary. 

The  need  to  identify  modal  frequencies,  for  example,  in  high  performance  disturbance 
rejection  systems  has  been  shown  in  ( l] .  Tl.  *  deployment  of  high  performance  optical  or  RF 
systems  may  require  on-line  identification  of  critical  modal  parameters  before  full  control 
authority  can  be  exercised.  Parameter  sensitivity,  manifested  by  performance  degradation 
or  loss  of  stability  (poor  robustness)  may  be  effectively  reduced  by  adaptive  feedback  mech¬ 
anizations. 

Most  adaptive  control  algorithms  can  be  described  in  the  form  shown  in  Figure  12.  For 
example,  one  could  select  from  the  folowing  catalogs  of  major  areas. 
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The  schemes  also  differ  in  terms  of  update  rates.  Typically  the  outer  control  loop  is  at  a  fast 
rate,  whereas  the  parameters  from  identification  are  updated  more  slowly.  Adaptive  schemes 
are  referred  to  as  recursive  if  the  identification  rate  is  a  fixed  multiple  of  the  controller  rate. 
If  identification  is  used  when  necessary  for  calibration  the  scheme  is  referred  to  as  adaptive 
calibration. 

Although  a  great  deal  of  research  results  are  available  about  adaptive  control  and  iden¬ 
tification,  unmodeled  dynamics  and  broadband  disturbances  will  significantly  upset  most 
algorithms. 

4.1  Adaptive  Calibration: 

The  use  of  a  “slow”  adaptive  control,  which  is  more  practical  than  recursive  adaptive  control 
in  most  space  applications  is  described  in  this  section.  It  is  referred  to  as  a  method  of 
adaptive  calibration.  The  term  “slow”  means  that  there  is  sufficient  time  to  run  batch 
identification  before  the  control  system  is  modified.  The  methodology  provides  a  guaranteed 
level  of  performance  given  an  “identified”  model  of  the  system  together  with  the  model 
error  between  the  system  and  the  identified  model.  In  fact,  the  methodology  generates 
performance  versus  model  error  tables  (to  be  stored  in  the  computer)  from  which  the  control 
design  is  immediately  obtained.  Moreover,  the  order  of  the  control  design  is  determined 
strictly  on  the  basis  of  model  error  and  performance  demand. 

4.2  Application  of  Adaptive  Calibration: 

The  basic  problem  with  control  based  on  identified  models  is  that  without  a  measure  of 
model  error  it  is  very  easy  to  destabilize  the  system  •  particularly  when  the  goal  is  high 
performance  -  as  in  LSS  systems.  Adaptive  calibration  is  an  approach  which  incorporates  a 
measure  of  model  error  with  robust  control  design  in  an  iterative  way  so  that  identification 
is  performed  only  where  it  is  needed.  A  proposed  adaptive  calibration  system  is  shown  in 
Figure  13  with  test  results,  using  the  CSDL  #2  model,  shown  in  Figure  14.  The  adaptive 
calibration  procedure  involves  the  following  Bteps: 

1.  The  model  M(s)  is  a  10-mode  model  which  has  been  obtained  from  I/O  data. 

2.  Estimate  S(w)  =  model  error  versus  frequency  using  FFT.  This  is  the  dashed  curve  in 
Figure  15. 

3.  Using  the  identified  model  M(s)  and  the  model  error  6(u>),  synthesize  a  robust  control 
(section  2). 

4.  Calculate  S,m  -  stability  margin.  This  is  the  dashed  curve  in  Figure  15.  Compare  to 
model  error  6  both  plotted  in  Figure  16.  If  acceptable  go  to  Step  6  and  implement 
controller.  Otherwise  go  to  Step  5. 

5.  Modify  filter  windows,  number  of  parameters  (e.g.  number  of  modes),  or  input  spec¬ 
trum  and  then  repeat  Step  1  to  obtain  new  ID  model.  Figure  16  shows  result  of 
identification  after  one  mode  is  added  in  the  frequency  domain  region  where  the  test 
fails. 

6.  Implement  controller. 
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CSDL  MOOEL  NO.  2 
Figure  14:  Draper  Simulation  System 


FREQUENCY  (RAO /SEC) 

Figure  16:  Data  Before  and  After  ID  Cycle  Plots  Show  Where  Error  is  Margin 
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Abstract 

A  summary  of  methods  of  averaging  analysis  is  presented  for  continuous-time  adaptive 
systems.  The  averaging  results  of  Riedle  and  Kokotovic  [1]  and  of  Ljung  [2]  are  examined 
and  are  shown  to  be  closely  related.  Both  approaches  result  in  a  sharp  stability-instability 
boundary  which  can  be  tested  in  the  frequency  domain  and  interpreted  as  a  signal  dependent 
positivity  condition. 

1.  Introduction 

For  a  large  class  of  adaptive  systems,  as  well  as  for  some  output  error  identification  schemes, 
a  stability  analysis  in  the  neighborhood  of  the  desired  behavior  leads  to  investigating  the 
stability  of  the  linearized  adaptive  system  described  by  an  equation  of  the  form, 

0  =  (1) 

where  0(t)eZp  is  the  adaptation  parameter  vector,  z(t)eZp  is  the  regressor,  and  e  >  0  is 
the  adaptation  gain.  The  theory  developed  in  [3,4]  shows  that  the  stability  of  adaptive 
systems  in  the  neighborhood  of  the  equilibrium  trajectories  is  dependent  on  the  stability  of 
this  system  of  linear  time-varying  equations.  System  (1)  for  example,  can  be  obtained  as 
a  result  of  linearization  of  the  adaptive  system  in  the  neighborhood  of  a  “tuned”  system, 
i.e.,  a  system  where  the  adaptive  parameters  are  set  to  a  constant  value  0.eZp  and  whose 
behavior  is  deemed  acceptable.  Hence,  in  (1),  8(t)  is  the  vector  of  parameter  errors  between 
the  parameter  estimate  at  time  t  and  the  tuned  value  (?»,  z(t)  is  the  regressor  vector  from  the 
tuned  system  (e.g.,  filtered  revisions  of  measured  signals),  and  the  scalar  c  is  the  magnitude 
of  the  adaptation  gain  which  essentially  controls  the  rate  of  adaptation.  The  operator  H 
depends  on  the  actual  system  being  controlled  or  identified  and  also  on  the  tuned  parameter 
setting  0.. 

It  is  shown  in  [3]  that  if  the  zero  solution  of  (1)  is  uniformly  asymptotically  stable  (u.a.s.), 
then  the  adaptive  system  is  locally  stable,  i.e.,  the  adaptive  system  behavior  will  remain  in  a 
neighborhood  of  the  desired  behavior  provided  the  initial  parameter  error  0(0)  and  the  effect 
of  external  disturbances  are  sufficiently  small.  Although  these  results  were  arrived  at  using 
input-output  properties,  local  stability  properties  can  also  be  obtained  from  the  results  on 
“total”  stability  [5]. 

1.1  Unmodeled  Dynamics  and  Slow  Adaptation 

In  the  ideal  case  there  are  a  sufficient  number  of  adaptive  parameters  (the  number  p)  such 
that  the  tuned  parameter  setting  results  in  H{s)  being  strictly  positive  real  ( SPR ),  i.e., 

H(s)  proper  and  stable,  and  Re  H(jvj)  >  0,Vwc£+.  Under  these  conditions,  we  have  the 
following  known  results:  (i)  the  zero  solution  of  (1)  is  stable,  i.e.,  0(t)  is  bounded  but  not 
necessarily  constant;  (ii)  if,  in  addition,  z(t)  is  persistently  exciting,  then  the  zero  solution 
is  u.a.s.,  thus,  0(t)  —♦  0  exponentially  fast  as  t  — »  oo.  The  trouble  starts  when  there  are  an 


insufficient  number  of  parameters  to  obtain  II(s)tSPR,  as  is  the  case  in  adaptive  control 
when  the  plant  has  unmodeled  dynamics. 

In  this  paper  we  will  examine  the  stability  of  (1)  when  c  is  small,  z(t)  is  persistently 
exciting,  and  H(s)  is  not  necessarily  SPR  but  only  stable.  We  will  refer  to  this  case  as  slow 
adaptation. 

1.2  Approaches  Based  on  Averaging 

In  a  recent  paper  by  Riedle  and  Kokotovic  [1],  a  classical  method  of  averaging  as  described 
by  Hale  [6]  was  applied  to  the  linearized  adaptive  system.  The  result  is  a  sharp  stability- 
instability  boundary  determined  by  a  signal  dependent  positivity  condition  which  asserts 
that  the  zero  solution  of  (l)  is  u.a.s.  if 

M£WwMw)*]*e  (»)  >  0  (2) 

w<n 

where  fl  and  {a(w),wcn}  are,  respectively,  the  Fourier  exponents  and  coefficients  of  u(t). 
Condition  (2)  can  be  considered  as  a  signal  dependent  positivity  condition,  but  unlike  the 
SPR  condition  Re  H(jw)  is  not  required  to  be  positive  at  all  frequencies.  Thus,  this 
result  is  significantly  weaker  than  the  SPR  condition  required  in  the  proof  of  stability  of 
adaptive  systems,  e.g.,  [7,8] .  In  order  to  apply  the  averaging  theory  to  obtain  this  result,  the 
linearized  system  has  first  to  be  decoupled  into  slow  (parameter)  states  and  fast  states.  It  is 
this  transformation  which  is  essential  to  the  averaging  approach  and  is  a  major  contribution 
in  the  Riedle-Kokotovic  method. 

Averaging  has  also  been  applied  to  the  counter-ex  ample  of  Rohrs  et  al.  [9]  by  Astrom 
[10,11].  In  this  analysis,  by  “freezing”  the  parameters,  the  parameter  and  state  equations  are 
decoupled  thereby  obtaining  the  asymptotic  trajectories.  Both  of  these  averaging  analyses 
assume  that  the  system  is  periodic  or  almost  periodic,  an  assumption  that  can  be  dispensed 
with  by  introducing  the  notion  of  a  sample  average  [12]. 

In  [13],  the  averaging  approach  is  extended  to  nonlinear  systems  by  introducing  the 
integral  manifold  which  completely  separates  the  parameter  and  state  equations.  This  latter 
approach  is  valid  for  the  nonlinear  adaptive  system,  and  not  just  the  linearized  part.  Related 
results  can  also  be  found  in  [14]. 

Averaging  methods  for  adaptive  systems  have  appeared  in  earlier  work,  the  most  notable 
of  these  being  the  averaging  method  developed  by  Ljung  [2]  for  use  in  discrete-time  recursive 
parameter  estimation.  The  analysis  shows  that  the  convergence  properties  of  the  estimates 
can  be  determined  from  the  stability  properties  of  a  related  set  of  ordinary  differential 
equations;  the  method  usually  referred  to  as  the  ODE  analysis. 

In  this  paper  we  summarize  the  results  obtained  by  Riedle  and  Kokotovic  [1,13]  and 
show  (heuristically)  how  they  are  related  to  the  local  stability  analysis  in  [3,4]  and  the  ODE 
averaging  approach  of  Ljung  in  [2]. 

2.  Adaptive  Error  System 

Although  it  is  unlikely  that  a  truly  generic  adaptive  error  system  can  be  formed  to  capture 
all  the  nuances  of  adaptive  systems,  the  SISO  adaptive  system  shown  in  Figure  1  is  offered 
as  a  good  representation  for  the  purposes  of  analysis.  The  system  equations  are: 


2 

a 

1 

• 

II 

(3a) 

z  =  z.  -  Hzu  v 

(36) 

v  =  z'O 

(3c) 

0  =  efz 

(3d) 

The  development  of  (3)  can  be  found  in  [15,16]  and  in  [17],  In  (3),  e(t)  €  R.  is  a  measured  error 
signal  which  drives  the  parameter  update  (3d),  z(t)  C  is  the  regressor,  and  0(t)  G  £p  is 
the  parameter  error  between  the  current  estimate  at  t  and  a  tuned  parameter  setting  0.  C  R.r. 


Figure  1:  Adaptive  error  system 


The  selection  of  0 ,  is  based  on  complete  knowledge  of  the  actual  plant  and  disturbances.  The 
system  corresponding  to  this  setting  is  referred  to  as  the  tuned  system.  The  signals  e.(t)  E  Z 
and  z,(t)  €  Zp  are  outputs  of  the  tuned  system,  and  are  referred  to  as  the  tuned  error  and 
tuned  regressor,  respectively.  The  signal  v(t)  E  Z  can  be  regarded  as  the  adaptive  control 
error. 

The  operators  Htu  and  Hzv  are  dependent  on  0.  and  describe  how  v  effects  the  error  and 
regressor  signals.  We  assume  here  that  Heu  and  Htv  are  linear-time-invariant  ( LTI )  with 
stable  proper  transfer  functions  Hev(s)  and  H,u(s).  This  would  arise,  for  example,  when  the 
plant  to  be  controlled  is  LTI  and  the  adaptive  controller  is  linear  in  the  adaptive  parameters. 
The  stability  of  Htv  and  Hzu  is  a  consequence  of  the  definition  of  9 ,  as  the  tuned  parameter 
setting. 

The  operation  T  depends  on  the  choice  of  parameter  update  algorithm.  We  will  restrict 
attention  here  to  the  following  representatives: 

Gradient 

(r*)(f)  =  ez(t) 

£  >  0 

Recursive  Least  Squares 

(r*)(0  =  P(t)z(t) 

iP-'i  0  =  *(*)*(«)' 
p(  o)  =  p(oy  >  o 

3.  Global  Stability  and  Passivity 

It  is  of  interest  to  determine  under  what  conditions  the  adaptive  error  system  (3,5)  produces 
bounded  outputs  (fl,e,v,z)  for  all  bounded  initial  parameter  errors  9(o)  E  Zp.  This  is  what 
is  meant  here  by  “global*  stability.  As  it  turns  out,  it  is  possible  to  prove  such  a  result 
provided  that: 

(')  Htv{s)  €  SPR  with  gradient 

(ii)  Htv{s)  -  |  E  SPR  with  least  squares 

(iii)  z.,i,  E  Lp  and  cither 


(4) 

(5) 


(6) 

(7) 

(8) 


b)  e.,e,  €  L <*,  and 

z  €  PE  (persistently  exciting)  (10) 

Parameter  convergence  to  a  constant  in  Rp  or  to  a  well  defined  subset  in  Zp,  requires  that 
(9)  be  strengthened  to: 

e.,e.  €  L2nLoo,  z.  €  PE  (11) 

The  above  results  can  be  found  in  [7,15,16]  and  in  [18] .  Although  of  theoretical  significance, 
they  are  not  feasible  to  obtain  in  practice.  In  the  first  place,  due  to  unmodeled  dynamics  [9], 
Hev(s)  €  SPR  is  practically  impossible  to  achieve  in  adaptive  feedback  and  even  in  some 
output  error  identification.  (This  is  not  the  case  in  equation  error  identification.)  Secondly, 
when  e,,€  Loo  as  in  (10),  it  is  required  that  z  €  PE  which  cannot  be  guaranteed  in  advance 
since  z  is  inside  the  adaptive  loop.  Case  (11)  which  requires  z.  €  PE  -  which  is  feasible  to 
establish  -  conflicts  with  e, ,  e.  €  L2  n  L «, .  The  latter  implies  e,  ( t )  — »  0  which  can  only  occur 
for  z.  G  PE  -  and  where  there  are  no  unmodeled  dynamics  which  we  argue  is  not  possible. 

With  these  impossible  to  satisfy  theoretical  requirements,  it  is  doubtful  that  a  global  sta¬ 
bility  theory  can  be  attained  which  relies  on  passivity,  i.e.,  condition  (6,7).  On  the  practical 
side,  however,  there  is  substantial  evidence  of  well  engineered  algorithms  that  work  without 
SPR  [10].  These  do  not  work  for  all  0(o)  and  for  all  e,,z,  in  L but  rather,  for  restricted 
magnitudes  and  signal  spectrums.  For  example,  if  Hev(s)  is  SPR  for  w  <  ub%u  then  it  is 
expected  that  the  adaptive  system  will  be  well  behaved  provided  there  is  insignificant  exci¬ 
tation  above  u>Bw  The  following  example  illustrates  some  of  this  phenomena. 


Example:  Consider  the  model  reference  adaptive  control  (MRAC)  system  studied  by  Rohrs 
et  al.  [9]  with  plant 

„/  x  2  229 

P(s)  = 


reference  model 


a  +  1  (a +15)2  +  4 
3 


Href(s)  = 


8  +  3 


and  adaptive  control  law 

u  =  -0  iy  +  d2r 

The  adaptive  parameters  are  obtained  from  the  gradient  algorithm, 

?x  =  ye 

=  -re 

e  =  y  -  H„fr 

For  this  example  we  have  the  tuned  error  given  by 

e.  =  H„r 

with 

_ 3 

erW  s3  +  31a1  +  259s  +  229(1  +  20.0  s  +  3 

We  also  have 

H  =  _ 158 _ 

“'V  1  a3  +  31a*  +  259s  +  229(1  +  2 0.x) 

Observe  that  H„(s)  and  Htv(s)  are  stable  provided  that 

0.x  G  (0,17.03) 

Since  H,v(s)  has  a  relative  degree  of  three,  it  follows  that  Hev(s)  is  not  SPR,  and  so  global 
stability  is  not  guaranteed. 

Figure  2  shows  0|(t)  vs.  0j(t)  for  simulations  corresponding  to  two  selected  inputs: 


Figure  2:  Parameter  drift  to  inputs  /?!  and  R2 
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Figure  3:  Blow-up  of  drift  to  input  Rl  . 


ri(t)  =  1  +  sin  3t 
rz(t)  =  1  +  tin  51 


with  initial  conditions  0i(O)  =  .65, 82  =  1.15  which  satisfy  the  DC  tracking  requirement.  The 
response  to  rj(t)  undergoes  a  transient  and  then  drifts  down  a  line  in  Zi  to  an  apparent 
stable  orbit.  Figure  2  shows  a  blow-up  near  the  stable  orbit  as  well  as  a  trajectory  which 
starts  just  below  it  and,  drifts  upward.  The  response  to  rj(t),  however,  is  unstable  in  the 
sense  that  the  parameters  continue  to  drift  and  eventually  $i[t)  will  exceed  17.03  and  the 
system  becomes  unstable. 

Most  adaptive  control  systems  show  the  characteristic  behavior  illustrated  in  our  exam¬ 
ple.  The  parameter  first  exhibit  a  transient  followed  by  a  steady-state  drifting.  The  papers 
by  Astrom  [10,11]  contain  many  examples.  In  the  example  here,  the  drifting  appears  to 
occur  along  a  line  in  £*.  In  one  case  (input  ri)  the  drift  stops  and  the  parameters  settle 
into  a  periodic  orbit.  With  an  apparently  modest  change  in  the  input  spectrum  {ri)  the 
parameters  now  drift  into  the  instability  region.  Therefore,  either  the  orbital  center  has 
drastically  changed  and  is  now  outside  the  constant  parameter  stability  set,  or  else  there  is 
no  stable  orbit  at  all  anywhere  along  the  £*  line  of  drift. 
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Figure  4:  Equivalent  feedback  representation  of  adaptive  error  system. 


In  the  forthcoming  sections  we  will  establish  conditions  under  which  the  qualitative 
properties  of  the  drifting  phenomena  can  be  predicted  under  slow  adaptation.  Our  analysis  is 
local  and  based  on  the  classical  methods  of  linearization  and  averaging  for  nonlinear  systems. 


4.  Local  Stability:  Small  Gain  Theory  and  Averaging 

Another  way  to  view  the  system  (3)  under  ideal  conditions  (6)-(  1 1)  is  to  arrange  the  system  in 
the  form  shown  in  Fig.  4.  Here,  the  forward  path  operator  is  defined  by  the  map  N  =  z  —*  v 
such  that 


z  =  zm  +  z  (12a) 

6  =  (\ze.  -  zHtu{z'0)\  (126) 

u  =  z'O  (12c) 

with  z  obtained  from  the  feedback  path, 

z=-H,v(4>'e)  (13) 

Observe  that  N  is  in  effect  the  linear  adaptive  system  that  we  mentioned  earlier.  Clearly  z 
is  the  amount  by  which  the  regressor  z(t)  differs  from  the  tuned  regressor  z,  (t). 

Now,  let  T  :  z  — *  \  denote  the  loop-gain  operator  defined  by  (12)  and 

l  =  -H'M'O)  (14) 

Small  Gain  Theory  asserts  [19]  that  if,  for  some  p  €E  [l,  oo],  the  Lp-gain  of  T  is  less  than  one, 
then  the  system  is  Lp-stable.  Let  7 P(T)  denote  the  Lp-gain  of  T,  i.e., 

7p(T)  =  inf{k  :  36  >  0  s.t.||Tz||p  <  fc|jz||p  +  6,Vz  6  Lp}  (15) 


when  p  —  2,  it  is  possible  to  show  that  for  all  z  €  Li, 

IMI*  <  ‘|*(o)| 

where  c  is  a  constant  independent  of  z  [16].  Hence, 

\\Ti\\t  =l|ff«w||  2 

<  72(ff*„)c|0(O)| 


(16) 


(17) 


Comparing  this  result  with  the  definition  of  gain  (15),  we  see  that  72(T)  =  0.  Thus,  under 
ideal  conditions,  the  loop-gain  of  the  adaptive  error  system  is  zero ! 

Now,  suppose  that  Hev(s)  is  not  SPR  and  e,(t)  /  0.  One  would  expect  that  small 
deviations  in  the  SPR' ness  of  Htv{s)  and  small  non-zero  magnitudes  of  e,{t)  could  be  toler¬ 
ated  without  trouble.  Unfortunately,  this  is  not  quite  the  case.  In  the  first  place  (16)  holds 
without  persistent  excitation.  This  means  that  system  N  (12)  is  only  uniformly  stable  (in 
the  sense  of  Lyapunov).  Recall  that  uniform  stability  is  not  robust  to  typical  perturbations. 
Uniform  asymptotic  stability  of  N  (equivalently  exponential  asymptotic  stability,  since  N 


1, 


is  linear)  is  robust  to  a  large  class  of  perturbations.  Thus,  a  basic  idea  behind  the  use  of 
various  forms  of  linearization  theorems  in  the  analysis  of  adaptive  systems,  is  to  insure  that 
the  system  N  is  u.a.s.  (uniformly  asymptotically  stable)  which  necessitates  that  z.(t)  be 
persistently  exciting.  Since  space  limitations  do  not  permit  us  to  elaborate  on  linearization 
theory  here,  the  interested  reader  is  referred  to  [3,4].  We  will,  however,  see  that  averaging 
imposes  a  natural  linearization. 

By  restricting  the  magnitude  of  9(o)  and  the  magnitude  and  spectrum  of  z,(t)  and  e.(t), 
it  is  possible  to  obtain  conditions  to  prove  local  stability  [3,4].  The  local  stability  property 
hinges  on  two  premises:  (i)  the  error  system  trajectories  are  in  a  (not  necessarily  small) 
neighborhood  of  the  tuned  solution,  and  (ii)  the  linear  time  varying  system  which  maps 
w  — *  9  as  given  by 

9  =  -(r  *.)(0*«W(-M-))  +  (ru>)(0  (is) 

is  Loo-stable,  i.e.,  there  exists  constants  k  and  6  s.t.||0j|oo  <  k|MI<»  +  b.  The  choice  of  F 
comes  from  (4)  or  (5)  and  z*(t)  is  the  tuned  regressor.  We  can  regard  (18)  as  a  linearization 
of  the  update  algorithm.  There  are  several  ways  to  establish  the  Loo  stability  of  (18). 

4.1  Gradient  Algorithm 

We  first  consider  the  case  when  T  represents  the  gradient  algorithm,  i.e.,  (rz,)(t)  =  ez,(t) 
with  e  >  0. 

In  [20],  it  is  shown  that  if  Hev(s)eSPR  and  z,ePE,  then  for  all  e  >  0,w  — *■  9  is  exponen¬ 
tially  stable,  and  hence,  Loo-stable.  In  [21],  if  Hev(s)  =  Heu(s)  +  sA(s),  Heu(s)eSPR,  A (s)  is 
stable,  and  z,ePE  then  for  sufficiently  small  e  and  ||z.[|oo»  w  -*  9  is  still  exponentially  stable, 
and  hence  is  Loo-stable.  This  latter  method  relies  on  loop-transformations  and  application 
of  small  gain  theory. 

Another  approach  is  to  use  averaging.  In  [l]  it  is  shown  that  if  z»fPE  with  the  Fourier 
series  representation 

z*(0  ~  53a(w*)e/“*<  (19) 

* 

and  if  the  eigenvalues  of  the  real  matrix 

B  =  Z^Q(w*)a(_w*Vtf«v(_^w*)  (20) 

k 

all  have  positive  real  parts,  then  for  all  sufficiently  small  c  >  0,u;  — ♦  9  is  exponentially  stable, 
and  hence,  Loo-stable.  Moreover,  if  any  one  eigenvalue  of  B  has  a  negative  real  part,  then 
iv  — *  9  is  exponentially  unstable.  Hence,  there  exists  u>fLooS-t-|0(OI  °o  as  t  — ►  oo  expo¬ 
nentially  fast.  It  is  obvious  then  when  Heu(s)  is  not  SPR,  but  only  approximately  so,  then 
the  Riedle-Kokotovic  result  provides  a  sharp  stability-instability  boundary.  Note  that  when 
Hlv(a)  is  SPR  and  z.iPE  we  have  from  [20]  that  to  — ♦  9  is  exp.  stable  for  all  e  >  0.  At  the 
present  time,  averaging  theory  as  applied  here,  does  not  hold  for  all  c  >  0  even  when  Hev(s) 
is  SPR.  On  the  other  hand,  the  result  in  [21]  remains  valid  for  ifew(s)cS P R(A(s)  =  o) 
because  then  c  >  0  is  bounded  above  by  infinity. 

Example  In  this  example  we  illustrate  what  happens  when  ReX(B)  >  0  but  e  is  too  large. 
Consider  the  scalar  system 

9  =  -(z,lltv(z,9) 

with  z,(t)  —  sin(.35t)  and  Htv{s)  =  l/(«*  +  2s  +  2).  In  this  case  B  is  a  scalar  and  it  is  easily 
verified  that  B  >  0.  The  simulations  in  Figure  5  with  5(0)  =  1  show  that  the  zero  solution 
is  u.a.s.  for  c  =  4  (and  for  all  «  <  4),  but  is  completely  unstable  for  e  =  8  (and  for  all  t  >  8). 
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Figure  5:  Parameter  trajectories  with  varying  gain 


4.2  Recursive  Least  Squares  Algorithm 

In  this  case  we  have  from  (5)  that 

(r  *.)(*)  =  P.(t)z.(t) 

Sp*(0_1  =  z.(t)z,(t)',P.(0)  >  0. 

When  z.tPE  there  exists  a  >  0  such  that 

P.(t)"1  =  P.(o)-1  +  f  z,(r)z.(T)'dr  >  at  - 1. 

Jo 

Thus,  it  is  convenient  to  define  R[t)  =  ip.(t)-1  for  t  >  0.  Hence  R(t)~1  =  tP,(t)  <  £•/  and 
we  can  write  (18)  and  (5)  as, 

0  =  \R~l[ w  -  z.Hev(z[9 )] 
k=\(z.z[-R) 

When  Htv{z)  -  5  is  not  SPR  we  can  now  follow  [2]  and  for  t  >  s  and  a  sufficiently  large, 
approximate  the  right  hand  side  by  its  average.  Letting  “overbar”  denote  average  (assuming 
it  exists)  we  have: 

R(t)  »  \(z7z[  -  R) 

Integrating  from  s  to  s  +  T,  T  >  0,  gives 


1  +  T)  -  0(a)]/  J‘+T  dt/t  «  R~l(w  -  (z.Htvz\)0) 
[K(«  +  T)-  f?(s)]/  J'+T  dt/t  k7~£-R 


Now  change  time  scales  a  +  T  — »  r  4-  Ar,  Ar  =  J*+T  dt/t  and  letting  a  — *  00  gives  the 
differential  equations: 

Oa(t)  =  Ra(t)~1[u>  -  B0a(t)]  (25) 

Ra(t)  =  ^  -  Ra(t)  (26) 

with  B  =  z.Htuz’%  given  by  (20).  These  equations  actually  describe  the  asymptotic  behavior 
of  (18)  in  just  the  same  way  as  they  do  for  discrete-time  (2).  In  order  to  validate  the 
approximations  in  each  of  the  steps  leading  to  (25)  and  (26),  it  is  necessary  to  introduce 
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various  regularity  conditions.  A  complete  proof  can  be  found  in  [2,22].  Here,  as  warned,  we 

ofTer  only  heuristics.  _ 

Observe  that  in  (26)  as  r  — ♦  oo,  Ra(t)  Thus,  when  Htv(a)  -  j  is  not  SPR  and 

z.ePE  with  Fourier  representation  (19)  the  asymptotes  are  stable  if 

Re  X (L)  >  0  (27) 


where  _ 

L-  =(2.2i)-‘fl 

=  (Et  a(wk)a(-wfc),)“1  Ek  <x(vk)a(uiky  Htv(-juk) 


(28) 


If  Re  H(juic)  >  p  >  0  at  low  frequencies,  and  if  jor(a;*)j  is  small  at  frequencies  where 
Re  H(j<jJk)  <  0,  then  Re  X(L)  =  p.  Thus,  all  parameter  asymptotes  have  a  uniform  rate  of 
convergence  which  is  not  the  case  for  the  gradient  algorithm  with  a  time-invariant  gain. 


5.  Averaging:  A  More  General  Approach 


In  this  section  we  will  establish  a  general  form  of  the  adaptive  error  system  (3,5)  which  is 
useful  for  application  of  averaging  methods.  The  first  step  is  to  transform  (3,5)  into  a  set  of 
nonlinear  time-varying  differential  equations.  To  do  this  observe  that  if  Heu(s)  and  HIU (s) 
are  strictly  proper  functions  (a  convenient  illustrative,  but  not  necessary,  assumption)  then 
we  can  write 

Hcv(s)  =  c'(sI-A)-'b 

Hxv{s)  =  D{sl  -  A)~lb  1 

where  A  6  Rnxn,b  €  Zn,D  €  R.pzn,  with  (A, 6,  [c  £)#j)  a  minimal  representation.  Also, 
Re  A  (A)  <  0  reflecting  the  fact  that  Hev  and  Hxv(a)  are  stable.  The  error  system  (3)  is  then 
equivalently  expressed  as 

e  =  e,  -  c'x 


(30) 


2  =  2.  -  Dz 
x  =  Ax  -t-  bz'9 
®  =  (r*)e 

By  eliminating  the  variables  e  and  z  we  can  reduce  (30)  to  the  coupled  state-space  description: 


4>  =  l{t)f{t,<f>,x) 


(31) 


i  =  Ax  +  g(t,<t>,x) 

With  the  gradient  algorithm  (4),  let 


m  =  m 

7(0  =  E 


(32) 


(33) 


f(t,6,x)  =  2,(:)e,(0  -  t)x  +  c'xDx 
Q,{t)  =  z,[t)c '  +  e,{t)D 
g(t,0,x)  =  b{z.(t)  -  Dx)'0 

With  the  recursive  least  squares  algorithm  (5),  define: 


(34) 


where  the  operator  co/{i?}  stacks  up  the  columns  of  the  matrix  R  to  form  a  vector.  Thus, 


(  /?"*(z,(t)e,(t) 

-Q,(t)x  +  c'xDx)  \ 

f{t,<t>,x)  = 

col{z,(t)z,(t)' 

-zt{t)(Dx)'  -  Dxz,(t)'  1 

(37) 

V 

+  Dx(Dx)'  -  R}  ) 

.( t )  -  Dx)'0 

(38) 

The  co/{-}  operator  was  used  by  Ljung  in  [2]  to  develop  the  discrete-time  version  of  (31,32). 


5.1  The  Integral  Manifold 

The  basic  idea  in  the  application  of  averaging  methods  to  (31,32)  is  to  see  what  happens  when 
7(t)  is  small.  Essentially,  <f>[t)  slows  down  and  we  would  expect  to  be  able  to  approximate 
the  right  hand  side  of  (31)  with  its  average,  i.e., 

*  «  (39) 


where 

=  lim  ^  f  f(t,  <f>,x(t,  4>))dt  (40) 

7  —*00  1  J0 

assuming  the  limit  exists.  (Such  is  the  case,  for  example,  when  f(tt<fr,x)  and  x)  are 

periodic  in  t  for  all  bounded  <f>  and  x).  The  function  x(t,<f>)  is  referred  to  as  the  state  of  the 
frozen  parameter  system,  i.e.,  x(t,4>)  solves  (32)  whenever  <f>  is  a  fixed  vector.  To  emphasize 
this  point  we  may  express  x(t,<f> )  as  the  solution  to  the  partial  differential  equation. 

a 

— i  =  Ax  +  g(t,4>,x),x{0,4>)  =  i(0) 


The  frozen  parameter  system  was  introduced  in  the  averaging  analysis  proposed  by  Astrom 

[Hl- 

In  order  to  remove  the  approximation  in  (39)  we  introduce  the  integral  manifold  as 
suggested  by  [13]  [see  [6]  for  discussion  of  the  integral  manifold] 

The  integral  manifold  M  of  (31,32)  is  the  set, 


M  =  {t,<(>,x  :  z(t„)  =  h(t0,<f>(t0))  implies  z(t)  =  h(t,  <(>{t)),  Vt  >  tD)  (41) 


By  substituting  x  =  h[t,<f>)  into  (31,32),  the  manifold  function  is  seen  to  satisfy  the 

partial  differential  equation 

=  Ah  + g{t,4>,h)  (42) 

Whenever  7 (t)  is  sufficiently  small,  a  reasonable  approximation  to  h(t,<j> )  is  given  by  h0(t,<j>) 
which  is  the  solution  to 

=  Ah0  g{t,  <f>,  h0) 

=  F(0)ho  +  G(9)z,{t) 
where  the  last  line  follows  from  (34)  with 


F[0)  =  A-  b9'D,  G{9)  =  b9' 


(44) 


In  (42),  6  and  t  are  regarded  as  independent  variables  and,  hence,  we  can  define  the 
stabilizing  parameter  set 

D,  =  {9  £  :  Re  X(F(0))  <  0)  (45) 

Thus,  for  7(t)  sufficiently  small,  we  can  refer  to  h(t,4>)  with  9(D,  as  the  stable  manifold , 
which  we  will  approximate  by  h0{t,9),9( D,. 


An  important  observation  to  make  at  this  point  is  that  the  approximate  manifold  function 
h0(t,$)  satisfies  the  same  partial  differential  equation  as  the  frozen  parameter  system  state 
x{t,4>)-  The  only  difference  is  in  initial  conditions.  However,  if  9tD ,  then  as  t  — *  oo  we  have 
h0(t,<f>)  ~  *(t,^)  — *  0  exponentially. 

The  final  transformation  on  (31,32)  is  obtained  by  examining  the  behavior  of  (<6,x)  in 
the  neighborhood  of  the  stable  manifold.  Introduce  the  error  state, 

f  (46) 

Using  (46),  and  (31,32),  we  have 

*  =  7(0/M>M)  +  0  (47) 

e=we-7(0M*.*)/M.*M)  +  o  (48) 

where  we  have  used  h<f,(t,  <f>)  to  denote  If  ~j(t)  is  sufficiently  small  it  can  be  shown  that 

under  suitable  regularity  conditions  we  can  approximate  h(t,4)  with  the  frozen  parameter 
state  and  obtain  the  approximate  system, 

£  =  F(0)(  ~  l(t)x*(t,<f>)f(t,<l>,x(t,<t>)  +  0 

Moreover,  if  7(t)  is  sufficiently  small  and  0  remains  (moving  slowly)  in  D,  then  ((t)  —*  0 
exp.  fast.  As  a  result,  by  the  same  reasoning  as  in  Section  4,  the  stability  of  the  asymptotic 
system: 

Mr)  =  7  (Mr))  (49) 

where 

f{4>)  =  l»m  ^  f  f{t,4>,x(t,4>))dt  (50) 

3  — »O0  1  Jo 

assuming  the  limit  exists.  The  stability  of  (49)  is  given  as  follows.  The  proof  is  in  J6].  • 
Theorem 

Let  <j>°  denote  a  solution  of 

7(*°)  =  o 

and  define  the  matrix, 

G  - 

Then,  provided  Re  A (G)  ^  0,  the  equilibrium  solution  4>a(t)  —  <t>°  of  the  asymptotic  system 
(49)  is: 

(i)  u.a.s.  if  maxj  Re  A<(G)  <  0. 

(ii)  unstable  if  max;  Re  A j(G)  >  0. 

5.2  Application  to  Gradient  Algorithm 

Applying  this  result  to  (33,34)  with  the  gradient  algorithm  and  with  z.ePE  and  TZel  —  0, 
gives  G  =  -B  from  (20).  Since  7 (t)  =  t  >  0,  we  can  only  conclude  that  if  Re  A (B)  >  0  and 
e  is  sufficiently  small,  then  0(t)cDt  long  enough  for  transients  to  die  out,  which  is  unprovable 
as  yet  in  general. 

Observe  that  9,  €  kp  such  that  zTeiT  =  0  does  not  define  an  equilibrium  of  the  actual 
system.  All  we  can  say  is  that  with  c  >  0  small,  there  is  a  9(t)  which  orbits  near  (to  order 
c)  the  equilibrium  of  the  asymptotic  system.  We  can  also  choose  to  consider  =  0  as 
a  defining  equation  in  a  candidate  tuned  setting.  Other  conditions  would  also  have  to  hold 
(e.g.  small  e.(t),  etc.)  which  may  be  obtainable  with  proper  input  selection.  In  other  words, 
the  signals  present  during  adaptation  should  be  similar  to  those  present  during  tracking  or 
disturbance  rejection.  Otherwise,  the  algorithms  choice  of  the  tuned  setting  (z7e7  =  0)  may 
be  undesirable. 


5.3  Application  to  Recursive  Least  Squares  Algorithm 

Under  the  same  conditions  and  with  the  same  provisions  as  above,  G  =  -L  with  L  from 
(28).  This  time,  since  7(f)  =  l/(— *Oas£  — >  00,  we  can  conclude  that  if  Re  A(L)  >  0,  then 
B(t)  — *  0  as  I  — ♦  00  at  a  rate  1  /t.  In  this  case,  due  to  the  presence  of  \/t,  the  parameters 
0(t)  asymptotically  approach  the  solution  of  (49). 

6.  Concluding  Remarks 

The  averaging  theory  described  here,  as  well  as  averaging  theory  in  general,  has  its  uses 
and  limitations  for  adaptive  system.  In  the  first  place,  the  theory  requires  slow  adaptation 
which  can  be  counter-productive  because  performance  can  be  below  par  for  the  long  period 
of  time  it  takes  for  the  parameters  to  readjust.  Secondly,  averaging  theory  is  a  form  of 
linearization  so  that  the  (nonlinear)  adaptive  system  must  be  initialized  in  a  (not  necessarily 
small)  neighborhood  of  the  tuned  system.  On  the  positive  side,  however,  we  do  obtain 
frequency  domain  conditions  which  explain  the  system  behavior  near  the  tuned  solutions. 
In  this  sense,  we  can  consider  the  results  of  averaging  theory  to  be  necessary  conditions  for 
good  performance  of  adaptive  systems. 

To  obtain  the  heralded  goal  of  frequency-domain  stability  conditions,  it  may  be  inevitable 
to  encounter  linearization.  Somewhat  less  intuitively  appealing  results  can  be  obtained 
without  resorting  to  direct  linearization  or  averaging,  e.g.,  in  [4,17,21].  These  results  arise 
from  a  combination  of  small  gain  theory  and  perturbation  theory. 
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Abstract 

An  approach  is  presented  to  the  problem  of  on-line  robust 
control  design,  referred  to  here  as  adaptive  calibration.  It  is  an 
iterative  approach  which  modifies  the  filter  and  model  structure 
characteristics  used  in  methods  of  system  identification  involving 
the  filtered  prediction  error.  An  estimate  of  model  uncertainty  is 
obtained  which  is  then  used  to  predict  closed-loop  system 
performance  with  the  new  control  if  it  were  implemented.  If 
predicted  performance  does  not  meet  the  specified  performance 
the  filters  and/or  model  structure  are  modified  to  enhance  model 
accuracy  in  the  frequency  range  required.  An  analysts  is 
presented  along  with  an  illustrative  example. 

1.  Introduction 

Stringent  closed-loop  performance  demands  require  very 
accurate  models  for  controller  design  within  the  system 
bandwidth.  Since  the  actual  dynamics  are  not  likely  to  be 
sufficiently  like  those  obtained  from  testing  methods,  it  is  of 
practical  importance  to  be  able  to  identify  the  system  dynamics 
on-line  and  then  tune  the  controller  to  the  updated  model.  The 
problem  of  on-line  system  identification  and  control  tuning, 
refereed  to  here  as  adaptive  calibration,  is  a  little  like  the  story 
about  Columbus:  “He  didn’t  know  where  he  was  going;  when  he 
arrived  he  thought  he  was  someplace  else;  and  when  he  returned 
he  wasn’t  sure  where  he  had  been.  And  amazingly,  he  did  it  all 
with  borrowed  money!”  The  moral  of  the  story  is  that  in  order 
to  obtain  a  model  from  on-line  identification  which  has  the 
requisite  accuracy,  either  unlike  Columbus,  we  need  to  know 
where  we  are  going,  i.e.,  know  the  answer  a  priori,  or  else  like 
Columbus,  derive  a  means  of  calibration  which  will 
automatically  adjust  the  identification  accuracy,  notwithstanding 
our  ignorance  of  the  true  dynamics. 

Although  we  use  the  term  calibration  to  refer  to  control 
design  based  on  an  identified  model,  we  are  of  course  really 
discussing  an  adaptive  controller.  In  this  case,  although  we  limit 
ourselves  to  infrequent  controller  adjustments,  we  ultimately  face 
the  same  issues  in  the  robustness  of  continuously  changing 
adaptive  controllers,  e.g.,  Anderson  e«  al.  (1986). 

In  this  paper  we  show  how  frequency  domain  bounds  on 
the  unmodeled  dynamics  of  the  identified  model  can  be  extracted 
from  standard  system  identification  procedures.  Such  bounds  are 
required  in  order  to  evaluate  the  performance  robustness  of  the 
control  design,  e.g.,  Doyle  and  Stein  (1981),  Safonov  et  al. 
(1981),  and  Vidyasagar  (1985).  The  basis  for  the  results  here 
can  be  found  in  Ljung  (1985),  (1986)  and  Wahlberg  and  Ljung 
(1986),  involving  the  use  of  parameter  estimation  methods  of 
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system  identification  as  a  means  to  estimate  the  system  transfer 
function. 

The  paper  is  organized  as  follows:  Section  2  discusses  the 
calibration  problem  and  the  issues  in  robust  control  design. 
Section  3  addresses  the  problem  of  determining  model 
uncertainty  bounds  from  system  identification  methods.  An 
example  is  presented  in  Section  4. 

2.  Calibration  and  Robust  Control  Design 

In  this  section  we  discuss  the  general  problem  set-up  for 
adaptive  calibration  in  the  context  of  disturbance  attenuation. 
The  system  to  be  calibrated  is  shown  in  Fig.  1  and  is  described 
in  discrete  time  by* 

y(0  *  G<fg)u(t)  +  v(r)  (2.1) 

“W  =  ~  fftMO  (2.2) 

where  G0(q)  and  F0(q)  are  the  transfer  functions*  of  system 
dynamics  and  feedback  compensation,  respectively.  The  function 
v(r)  represents  the  effective  disturbances  as  seen  at  the  output 
We  assume  that  F(fq)  is  a  stabilizing  controller  for  G(fq),  but 
is  to  be  replaced  with  another  controller  which  is  expected  to 
improve  performance.  The  controller  Fffq)  can  be  thought  of 
as  a  controller  from  a  previous  calibration  or  as  a  “back-up” 
low-authority  stabilizer.  In  a  large  space  structure,  for  example, 
co-located  rate  dampers  are  used  throughout  the  structure  to 
provide  robust  stability,  but  these  being  of  low-authority  provide 
low  performance. 

In  order  to  tune  or  replace  the  controller  F0(q)  in  Fig.  1 
we  need  a  better  model  of  (2.1)  then  what  was  used  to  design 
F0(q)  in  the  first  place.  What  we  mean  by  the  term  model  is: 

(a)  a  nominal  model  of  G0(q)  and  v(r)  in  (2.1) 

(b)  a  set  of  uncertainty  associated  with  the  nominal  model. 
These  together  with 

(c)  a  performance  measure 

constitute  a  robust  control  design  problem  for  disturbance 
attenuation. 


The  variable  q  is  used  to  denote  the  forward  shift  operator,  i.e., 
qx(t)  *  x(it  iy 

*  Strictly  speaking,  C^q)  is  an  operator  whereas  the  complex  func¬ 
tion  £>u(i),  re  C,  is  a  transfer  function. 


Performance  Measure 

Suppose  that  performance  it  specified  in  terms  of  average 
power,  i.e.,  it  is  desired  that 

*  fa  £  A*)  <  yipcc  (2-3a) 

N--  "  ,«  , 


and  in  addition  that 


»vg  {«*(»)}  <  Mqx 


If  0,(0))  and  G>„(to)  are  the  power  spectral  densities  (PSD)  of 
y(r)  and  u(r),  then  (2.3)  is  equivalent  to* 

IK 

-57  /  O/caJdo)  <  (2.4a) 

-* 

K 

■57  <  u|*c  (2.4b) 

Note  that  (1)  is  normalized  frequency  which  is  constrained  to 
the  interval  1  -  x.  x).  Let 

u(r)  =  -  F(q)y{t)  (2.5) 

be  the  feedback  compensation  which  is  to  replace  that  in  (2.2). 
If  Q^co)  is  the  PSD  of  v(r)  then 

«yco)  =  |S(«>“)IJ<P,(«)  (2.6a) 

*„(»)  =  (2.6b) 


(»)  $(q)  |  |1  d(<?)F(q)r'  is  stable  (2.12a) 

(ii)  6(c»  <  bmm  i  1/|(5(«>**)1,  v  oc  l-x.x)  (2.12b) 

(iii)  6(q)  i  Fiq&q)  is  stable  (2.12c) 

The  function  6^,(0)  in  (2.12b)  is  referred  to  here  as  the 
stability  margin.  The  sensitivity  function  (2.6)  can  be  written  as, 

S(q)  *  &«?)|1  +A(qXfa)r'  (2.13) 

This,  together  with  (2.6),  (2.11),  gives  the  following  frequency 
domain  sufficient  condition  for  (2.8)  to  hold: 

<  |  1  _  •«(“)  <  ®5P*e(ta).  *  tte  Hut)  (2.14a) 

®.(o)  <  1 1  •j(«0)  £  •?“(«•).  v  “«  I-*.*)  (2.14b) 

Observe  that  6(0)  <  5„(co)  is  necessary  for  (2.14)  to  be 
satisfied,  i.e.,  (2.14)  implies  stability  robustness.  Condition 
(2.14)  can  be  thought  of  as  a  condition  for  performance 
robustness.  That  value  of  6(0)  for  which  equality  holds  in 
(2.14)  is  referred  to  as  the  performance  margin  and  is  given  by 
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,v  v.vN 
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6  (to)  =  SM(ci>)  (1  -  p(u)) 


s 


(2.15a)  •+ 


1 


S(q)  =  [I  +  GMfFdifr1  •  (2.7a) 

Q(q)  k  Fiq)S{q)  (2.7b) 

For  (2.6a)  to  be  meaningful  it  is  assumed  that  v(r)  has  a  PSD. 
Unless  otherwise  stated,  this  is  a  standing  assumption  whenever 
we  say  that  a  signal  has  a  PSD. 

Suppose  that  there  exist  functions  d^*c(oi)  and  G>y“(<») 
such  that 

%(a>)  <  O^tto)  (2.8) 

0.(0))  <  oy“(0))  (2.9) 

are  sufficient  to  satisfy  (2.4) 

Robust  Control  Design 

If  C(q)  and  «P^to)  are  perfectly  known  then  the 
compensator  F(q)  can  be  designed  to  satisfy  (2.8)  by  any 
number  of  methods.  Realistically,  G(q)  and  O/to)  may  vary 
or  are  not  perfectly  known.  Suppose  we  have  the  set  of 
uncertainty  in  Goto)  given  by 

Goto)  =  <5to)  +  Ato)  (2.10) 

where  6(q)  is  a  known  nominal  value  whereas  the  only 

knowledge  about  Ato)  is  Oat  it  is  stable  and  bounded  by 

iA(e*“)|  <  8 (to)  (2.11) 

In  order  to  provide  closed-loop  stability  for  all  uncertainty  of  the 
form  (2.10).  (2.11)  it  is  necessary  and  sufficient  that  [see  e.g., 
Callier  and  Desoer  (1982)] 


*  The  PSD  of  y(t )  is  defined  as  1 he  two-sided  Fourier  earn  form  of  the 
autocorrelation  function: 

■  * 

X  (t)  ■  lim  rj  £)<'))<('^t) 

Note  that  the  Fourier  transform  of  RJ T)  eaisu  if  R/t)  it  in  f|(R). 


i*pi«r 


(2.15b) 


Hence,  (2.8)  will  hold  if 

6(to)  <  Spm(co)  ,  v  <o  (2.16) 

Observe  also  that  (2.16)  makes  sense  only  if  the  nominal 
closed-loop  system  strictly  satisfies  (2.8),  i.e.,  if 

p(o)  <1  ,  V  roc  [  -x,  x]  (2.17) 

Condition  (2.16)  can  be  used  to  evaluate  candidate 
compensator  designs  F{q).  If  the  candidate  satisfies  (2.16)  then 
the  design  can  be  implemented  with  confidence.  If  not,  then 
(2.16)  provides  information  as  to  the  range  of  frequencies  over 
which  the  design  needs  to  be  modified.  Such  information  can  be 
incorporated  in  the  next  design  iteration,  e.g.,  frequency-shaped 
LQG  as  described  by  Safonov  (1981),  Gupta  (1980),  and  Stein 
and  Athens  (1985).  In  order  to  apply  (2.16),  it  is  necessary  to 
have  a  nominal,  dynamical  model  G(q) ,  the  model  error  bound 
8 (o>) ,  and  the  disturbance  PSD  ®„(«).  It  is  clear  that 
knowledge  of  O^cu)  can  be  relaxed  to  knowing  an  upper  bound 
on  Ov((i>)  in  (2.16).  The  practical  question  to  ask  is  how  do  we 
obtain  this  information?  In  terms  of  on-line  tuning,  can  we  do 
this  automatically? 

3.  System  Identification  and  Model  Uncertainty 

In  this  lection  we  show  how  system  identification  methods 
--  specifically,  parameter  estimation  -  can  be  used  to  provide  a 
nominal  model  6(q) ,  a  model  error  bound  6(o>) ,  and  an 
estimate  of  Ov(to). 

Consider  the  least  squares  parameter  estimator 

6*  -  J»  (6)  (3.1a) 
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where  D  is  a  sublet  of  and  tftfi)  U  the  filtered  error 
signal  dependent  on  6.  Suppose  c /r,8)  has  the  PSD  given  by 
4>t/w.6).  Then, 

0  £  lim  6n  =  are  min/(8)  (3.2a) 

*«  D 

a 

J(0)  =  avg{e?(r,8)}  =  —  J  ®^(M.8)d<i)  (3.2b) 

“  -a 

The  usefulness  of  (3.2)  is  that  if  N  is  large  then  0(N)  is 
approximately  8.  Moreover,  recursive  methods  produce 
estimates  which  asymptotically  approach  6,  e.g.,  Ljung  (1985), 
(1986). 

Parametric  Models 

The  results  and  notation  of  this  section,  which  are  needed  in 
the  sequel,  are  taken  directly  from  Ljung  (1985,  1986).  Assume 
that  Cy(r,  0)  is  generated  from  a  parametrized  observer  or 
predictor  as  follows: 

C/r.  8)  =  Hq)z(t,  8)  (3.3) 

where  L(q)  is  a  stable  filter  operating  on  the  prediction  error 

t(t,  8)  =  y(r)  -  y(t,  8)  (3.4) 

Here  y (r,  8)  is  the  one-step  ahead  prediction  of  y(r)  based  on 
some  parametric  model  for  the  data  set  {y(r),  u(r)}  generated 
from  (2.1),  (2.2),  i.e., 

yif)  «  Gfa)u(t)  +  v(r)  (3.5a) 

•<(0  =  -  Fo(q)y(t)  (3.5b) 

where  v(r)  is  a  zero-average  sequence  with  PSD  given  by 
®„(«).  A  typical  parametric  model  is 

y(r)  =  G{q,  6)u(r)  +  H(q,  6)e(r)  (3.6) 

where  G  and  H  represent  system  dynamics  and  e(f)  is 
“unpredicuble",  e.g.,  a  zero  average  sequence  with  constant 
PSD.  Hence,  we  have 

8)  »  [  1  -  H-\q,  8)M»)  +  H-\q,  B)G(q,  8)u(r)  (3.7a) 

c(r,  8)  =  H~'(q,  8)Lv(f)  -  G(q,  6)u(r)]  (3.7b) 

The  parametrization  of  (3.6)  can  always  be  selected  so  that  the 
map  (u,  y)  -*  y  defined  by  (3.6)  is  stable.  For  example, 
consider  the  scalar  ARMAX  model, 

A(q)XO  =  B(q)u(t)  +  C(q)e(r)  (3.8) 

where  A,B,C  are  polynomials  in  q~]  whose  coefficients  are 

the  elements  of  0.  Hence, 

G(q,  8)  =  B{qVA(q)  (3.9a) 

H(q.  0)  =  C(q)!A{q)  (3.9b) 

then 

>('.  6)  =  [l  -  ]  y[t)  +  g  u(t)  (3.10a) 


(3.10b) 


Observe  that  (u,  y)  -*  y  is  stable  if  1  IC(q)  is  suble,*  which  is 
alv  ays  possible  to  enforce.  Hence,  we  take  the  set  D  cR'  in 
(3.1)  as, 

0  =  { 8*  :  (»,>)  -*  j  in  (3.6)  is  stable}  (3.11) 


Note  that  when  C(q)  ■  1  we  have  the  equation  error  model, 
which  together  with  (3.1)  gives  the  least  squares  solution.  In  this 
case  the  set  0  ■  Note  also  that  filtering  of  the  prediction 
error  is  equivalent  to  pre-filtering  the  data  set,  i.e.,  replace 
{*».  *0).  i  -  I.  Ji)  with  {Uq)M.  Uq)Ht).  t  -  1.  Jt}. 

Frequency  Domain  Interpretation  of  Identification 

The  PSD  of  the  filtered  prediction  error,  denoted  by 
0)  in  (3.2)  is  given  by: 

<fy<o,  8)  =  Ue1^ **,(«.  8)  (3.11a) 

Ot(o>,8)  *  |W-'(e^,  0)|Jl|A(e*  +  Or(w)J  (3.11b) 

where  A to,  6)  is  model  error  [see  (2.10)],  i.e., 

Mg.  8)  =  G(Jiq)  -  C(q.  8)  (3.12) 

and  d>v(o>)  is  the  PSD  of  v(r).  The  above  expressions  are 
obtained  by  combining  (3.6),  (3.7),  and  (2.1),  together  with  the 
following  assumptions  [see  Ljung  (1985,  1986)): 

(Al)  Go(q)  suble  (3.13) 

(A2)  F0to)  =  0  (3.14) 

(A3)  dUw)  =  0  (3.15) 

Because  the  plant  G0to)  is  suble  (Al),  a  stabilizing  feedback  is 
not  required  (A2).  It  is  often  the  case  that  for  practical  reasons, 
a  feedback  is  present  regardless  of  the  plant  subility.  The 
expression  for  8)  in  (3.11b)  is  then, 

•du.  8)  -  ur'tt*.  enl{ia(*>“,  ejitouw)  +  +  2iuia(*^. 

In  this  paper  we  concentrate  on  the  simple  case  when  no 
subilizing  feedback  is  required,  i.e.,  when  assumptions  (Al)- 
(A3)  are  satisfied. 

Suppose  we  have  determined  Be  D  from  (3.2).  We  then 
have  the  transfer  functions  G(q,  0)  and  H(q,  8).  Passing  the 
dau  {y(r),u(/),r  =  1,...}  through  the  predictor  with  0  =  8 
gives  the  prediction  error 

e(r,  6)  =  H-\q,  §)[>(/)  -  Gto,  0)u(r)]  (3.16) 

Hence,  the  PSD  of  e(r,  0)  is, 

«»*(©.  8)  -  Vr'iq.  0>l2[|A(e'“,  0)|2®.(o>)  +  a^to)]  (3.17) 

In  (2.14)  we  computed  upper  bounds  on  ®,((o)  and  ®„(w) 
which  depends  on  knowledge  of  |A(e*\ 0)|  and  ®v(u).  From 
(3.17)  we  see  that  it  is  precisely  these  functions  which  are  the 
“unknowns,”  Le.,  the  function  H(q,  6),  ®t(co,  6),  and  ®,(cu) 
can,  in  principal,  he  computed  asymptotically  from  the  dau  set 
{y(r),  u(r),  t  =  1 as  N  -*  ••.  An  interesting  equivalent 
expression  for  (2.14)  can  be  obtained  by  using  (3.17)  to 

eliminate  C^w).  By  introducing  the  shorthand  notation 

®,(o>)  =  ®,(o>,  0)  ,  d(e*»)  =  G(e*\  0) ,  etc.  (3.18) 

and  by  dropping  the  explicit  o>  and  e**  dependence,  we  have 
from  (3.17)  that 

®,  =  |W|J®t  -  |AP®,  (3.19) 

Hence,  the  inequalities  in  (2.14)  become, 

<t>f  <  TdADli//!2®,  <  <PJP*  (3.20a) 

«*,  <  P(|A|)|dH|*«».  <  ®ip*c  (3.20b) 

where 

Ylb)  t  (1  -  &/6„rJ[l  -  (S/6*,)2]  (3.21) 

8*,  t  |f?l(0»/®.)'°  (3.22) 


•  *  ■  *****  -r  * 


Recall  from  (2.12b)  that 

fiM  £  V\&  (3.23) 

Let  Qg  and  0_  denote  the  complementary  frequency  ranges 
defined  by 

Oo  *  {<*>  «  l  -  K,  n] :  im  >  8*}  (3.24) 

O-  =  {«  «  I  -  K,  *] :  S*.  <  8*}  (3.25) 

Thus. 


y(S«,)  *  +» 

sup  K(8)  =  ' 

*  2  °  n 8pi)  <  - 


v  to  e  n„ 
v  <■)  €  Qo 


V  -  &S-  (3  27.) 

■  7T7CC?  »"» 

Observe  that  Vue  £)_,  it  is  not  possible  to  insure  satisfaction 
of  any  yinr/e  requirement  such  as  <P,(a>)  <  ®Jp“(ti))  <  •». 

Suppose  an  upper  bound  on  to)  is  available. 

Hence, 

d>„  <  «*>?“  •  v  ®  (3.28) 

and  it  follows  from  (3.19)  that 

(329) 

where 

Sto  I  [82,  -  <PrV«>J,a  .  Vucl-s.it]  (3.30) 
Consequently,  we  have 

fy(6J»)  =  +  -  .  vatfl. 


sup  y(6)  *  4  y(8pi) 


v  WE  o. 
v  to  E  (3.31) 
v  <0E  05 


ns  =  {«  e  Oo  :  8^  >  8to}  (3.32a) 

Qo  =  {co  e  Oo  :  h,*  <  8to}  (3.32b) 

A  typical  plot  of  Y(&)  vs  8  for  some  ue  fig  it  shown  in 

Fig.  2. 

The  expression  in  (3.31)  together  with  the  performance 
requirement  of  (3.20)  provide  an  indication  as  to  the 
“goodness”  of  the  identified  model.  Thus,  at  those  frequencies 
where  (3.20)  fails,  i.e.,  for  to  €  where 

Qw  *  {“  *  l  -  *.  K) :  sup  T(8)  >  T)}  (3.33a) 

'  *  <333b) 

it  is  necessary  to  either  abandon  the  specification  or  else  perform 
the  parameter  estimation  (3.2)  under  different  conditions.  For 
example,  some  choices  are  to  modify  the  filter  L(q),  change  the 
parametric  model  order,  or  change  the  parametric  model 
structure.  Exactly  what  the  rules  are  for  such  re-identification 
remains  an  open  question,  but  certainly  will  rest  on  the 
experiment  design  criteria  discussed  in  Ljung  (1985,  1986), 
Wahlbcrg  and  Ljung  (1986),  and  in  Goodwin  and  Payne  (1977). 

For  example,  one  possible  way  to  alter  the  filler  Hq)  is  to 
enforce  the  condition: 


[sup  y(6)/n)l'JiLoWi  ,  to  e  Qm 

ILbuI  ,  otherwise 


Effect  of  Finite  Data  Record 

Calculation  of  the  bounds  in  (3.31)  involve  knowledge  of 
©,(to.  6) ,  «*„(«) ,  G(«'“,  6) ,  etc.  First  of  all,  6  from  (3.2) 
is  the  asymptotic  estimate.  What  is  actually  available  is 
from  (3.1)  for  some  finite  N,  e.g.,  typically  on  the  order  of 
N  =  1024.  Secondly,  for  any  value  of  6  computation  of 
$c(b>,  6)  involves  the  infinite  data  record  {e(r,  6),  r  =  1,2,...}, 
whereas  what  is  realistically  available  is  {e(r,  6),  r  •=  1,2,...,  Nj. 
The  effect  of  finite  data  record  on  transfer  function  estimation 
accuracy  has  been  examined  in  Ljung  (198S,  1986),  where  the 
following  approximation  of  J^(8)  in  (3.1b)  is  established  for 
sufficiently  large  N: 

a 

M*)  =  i  J  UL(e'“)£w(ai.  0)||2  du>  (3.35a) 


£*( C0.  0)  =  tr'd*  0)1^01)  -  G(e>“,  0)^01))  (3.35b) 

with  En,  Yn  and  Un  the  Discrete  Fourier  Transforms  (DFT) 
of  C.  y  ,  and  w  ,  respectively.*  We  use  the  definition  that 

X*((o)  4  DFT{*(r)}  i  -L  £x(r)e->“  (3.36) 

w  m\ 

Observe  that  the  DFT  is  actually  computed  at  discrete 
frequencies,  e.g.,  if  N  is  even  then 
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CQ  = 
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k  =  -  *£.+  1  0- 
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Based  on  the  definition  in  (3.36),  an  estimate  of  the  PSD  of  x(r) 
given  finite  data  {x(r),  t  =  1, ....  A/}  is 

®5W)  =  jf  l**(to)I2  (3.38) 

This  estimate  of  0,(0))  is  not  always  smooth  and  often  looks 
“noisey”.  Smoothing  can  be  accomplished  by  introducing  a 
lag-window  which  is  effectively  a  frequency  domain  correlation 
of  PO/ca)!2  with  a  weighting  (or  window)  function  W(o>)  [see, 
e.g.,  Jenkins  and  Watts  (1968)  and  Ljung  and  Glover  (1981)]. 

A  Calibration  Procedure 

The  following  steps,  depicted  in  Fig.  3,  are  illustrative  of  a 
calibration  procedure  which  follows  more  or  less  naturally  from 
(3.31)  and  the  parameter  estimator  of  (3.2). 

Step  I:  Obtain  from  (3.1)  using  the  model 

structure  (3.3)-(3.7). 

Step  2:  Using  the  identified  system 

G(q,  6N),  H(q,  6N)  design  a  controller 
u(r)  =  -  F(q)y(t)  satisfying  (2.17). 

Step  3:  Using  ,  filter  the  data  set 
{></),  u(t),  r=l, ....  A'}  to  obtain  the  sequence  c(t,  &N) 
defined  by  (3.16). 


*  The  approximations  in  (3.35b)  follows  from  the  fact  that 
If  y<t)  m  T(q)u(t )  then  lir^w)  -  r<»,“)t/*<(D)l|  <  O  with 

C  »  J(  sun  K«(<)ll)(£  *  lift*  ill .  being  the  pulse  response. 
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Step  4:  Using  a  PSD  approximation  estimate 
6*)  and  &m(o>)  from  E(r,  ft*)  and  u(r), 
respectively,  and  then  calculate  sup  Y(8)  vs.  to  from 

(3  31). 

Step  5:  Check  performance  robustness  using  the  test, 
sup  F(8)  <  tj  ,  v  0)  6  |  - 1,  j] 

with  n  given  by  (3.33b).  If  the  inequality  holds  for 
all  u> ,  then  implement  the  controller  from  Step  3. 
Otherwise,  go  to  Step  6. 

Step  6:  At  those  frequencies  where  the  test  in  Step  3 
fails,  either  relax  performance  or  reduce  the  model 
error  by  repealing  Step  1.  This  involves  changing  the 
filter  L(q)  or  increasing  the  model  order  or  both. 

4.  Application  to  a  Laser  Pointing  Experiment 
In  this  section  we  obtain  the  model  error  estimate  S*j((d) 
of  (3.24)  using  data  from  a  laser  pointing  experiment.  Analysis 
of  this  laser  pointing  experiment  can  be  found  in  Walker,  Shah, 
and  Gupta  (1984).  As  described  there,  the  objective  of  the 
experiment  (see  Fig.  4)  is  to  control  the  jitter  of  a  laser  beam. 
The  single  actuator  consists  of  a  proof-mass  which  exerts  a 
reaction  force  on  the  flexible  beam  when  the  proof-mass  is 
moved  by  an  applied  armature  current  input.  A  rate-sensor  is 
provided  on  the  actuator  to  measure  actuator  velocity  (required 
when  very  high  bandwidihs  are  used).  As  the  flexible  beam 
vibrates,  the  laser  beam  changes  its  angular  direction.  A  second 
sensor,  a  quad  detector,  mounted  on  the  structural  support  picks 
up  the  position  of  the  beam,  as  long  as  it  is  in  its  field-of-view. 
The  laser  beam  strikes  a  mirror  on  the  flexible  member  and  then 
is  reflected  back  by  another  mirror  mounted  on  the  proof-mass 
actuator.  The  resulting  beam  is  split  by  a  beam  splitter  into  two 
rays,  one  going  to  the  quad  detector  and  the  other  going  to  a 
screen  where  the  jitter  is  magnified.  The  proof-mass  actuator 
controls  both  the  flexible  beam  vibrations  and  the  optical  path  tilt 
and  hence  can  reduce  the  laser  beam  jitter.  The  mass  of  the 
actuator  is  greater  than  the  flexible  beam  and  therefore  the 
interaction  between  the  modes  of  the  flexible  beam  and  the 
actuator  is  significant. 

Obtaining  the  Model  for  Control  Design 

Jitter  control  was  desired  in  the  region  of  4  Hz  to  20  Hz. 
Consequently,  a  sine  sweep  was  applied  to  the  actuator  lasting 
about  sixteen  seconds,  sweeping  from  4  Hz  to  20  Hz  The 
sampling  rate  was  31.2  Hz.  Figure  3  shows  that  the  magnitude 
of  the  input  signal  between  0  Hz  and  4  Hz  and  between  20  Hz 
and  23.6  Hz  is  very  low  and  therefore  the  model  will  not  be 
accurate  in  those  frequency  ranges.  Figure  6  shows  a 
comparison  with  the  “true"  transfer  function  (the  smooth  dashed 
line)  and  a  transfer  function  estimate  obtained  by  simply  taking 
ratios  of  the  DFT’s  of  the  output  and  the  input,  i.e., 

G*(a>)  *  F*(a»/l/*(a»  (4.1) 

This  latter  estimate,  referred  to  as  the  empirical  transfer  function 
estimate  is  very  noisy  as  expected,  since  it  is  a  modest  reduction 
of  the  raw  data.  What  we  refer  to  here  as  the  "true”  system  is 
an  8**  order  equation  error  model  ((3.8)  with  C(q)  ■  1  J 
obtained  from  the  data  by  solving  the  least  squares  problem 
(3.1). 

Using  the  raw  data  we  now  compute  via  (3.1)  a  4'*  order 
equation  error  model  Its  transfer  function  is  plotted  in  Fig.  7 


along  with  the  empirical  transfer  function  estimate.  Figure  I 
compares  the  8“  order  “true”  system  (dark  line)  with  the  4** 
order  estimate  (dashed  line).  The  model  error  is  obviously 
significant,  particularly  from  about  3  to  9  Hz  where  control  it 
critical. 

We  now  compute  8*,<g>)  from  (3.22).  In  Fig.  9  we  plot 
6fc(u)  in  comparison  with  the  “true"  model  error  between  the 
1“  order  model  and  the  4'*  order  model,  Le., 
| G(e^“,  §t)  -  G(e 6«)|.  Except  for  tome  noise  at  the  low  and 
high  frequencies,  the  estimate  it  very  accurate  where  necessary. 
Therefore,  it  is  possible  to  be  confident  about  implementing  a 
controller  based  on  the  model  set  G(q,  64)  +  a (q), 
|A(«**)|  <  6fc(u).  That  is,  if  the  test  in  Step  3  holds  there  is  lit¬ 
tle  chance  of  failure,  whereas  if  the  test  fails  one  should  proceed 
with  caution  before  implementing  a  new  controller.  In  the  latter 
case  it  would  be  prudent  to  re-identify  will)  filtered  data,  e.g., 
using  the  heuristic  in  (3.34).  The  results  of  such  iterations  are 
not  explored  in  this  paper. 

5.  Concluding  Remarks 

Preliminary  results  of  calculating  frequency-domain  model 
error  bounds  from  identified  models  has  been  presented.  This 
calculation  is  critical  if  identified  models  are  to  be  of  use  in  con¬ 
trol  design.  The  results  show  that  the  proposed  estimates  are  in 
good  agreement  with  the  true  error  in  the  frequency  band  that  is 
critical  for  control  design.  Many  issues  remain,  namely: 

(1)  What  are  the  best  ways  to  smooth  the  model  error  esti¬ 
mates,  e.g.,  “windowed”  PSD; 

(2)  What  are  the  heuristics  for  iterating  on  the 
identification  algorithm  when  the  estimated  model  error 
is  too  large  over  some  frequencies,  e.g.,  increase  model 
order  and/or  tune  the  data  filters  (these  involve  experi¬ 
mental  design  issues  of  input  selection,  etc.); 

(3)  What  is  the  effect  of  structured  uncertainty  on  the  esti¬ 
mate  of  model  error; 
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ABSTRACT 

Methods  are  developed  to  analyze  the  transient  behavior  of  an  adaptive  system. 
It  is  shown  that  both  small  gain  theory  and  the  method  of  averaging  can  be  used  to 
predict  some  of  the  observed  transient  phenomena.  An  important  tool  in  the  analysis 
is  fixed-point  theory,  illustrated  by  the  Contraction  Mapping  Principal,  which  enables  a 
sequential  application  of  linear  analysis  to  the  separated  state  and  parameter  equations 
describing  the  adaptive  system.  It  is  also  demonstrated  that  averaging  theory  applied 
on  finite  time  intervals  can  predict  transient  phenomena  without  requiring  slow 
adaptation. 
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1.  INTRODUCTION 


In  this  paper  we  consider  an  analysis  of  the  transient  response  of  adaptive  control 
systems.  An  understanding  of  the  transient  is  required  in  order  to  satisfy  practical 
requirements  such  as  those  arising  from  constraints  on  tracking  response  and 
disturbance  attenuation.  For  example,  consider  an  adaptive  system  subject  to  abrupt 
set-point  changes,  e.g.,  step  inputs.  Typical  system  requirements  are  stated  in  terms  of 
rise-time,  overshoot,  undershoot,  and  settling-time.  Unlike  a  non-adaptive  system,  two 
sets  of  such  requirements  are  needed;  one  set  determined  by  the  goal  of  the  adaptive 
system,  i.e.,  when  the  adaptive  parameters  are  near  convergences,  and  another  set  of 
requirements  dealing  with  the  transient,  i.e.,  when  the  adaptive  system  is  learning.  The 
latter  requirements  include  reasonable  length  of  time  for  learning  as  well  as  bounds  on 
responses  imposed  by  hardware  limitations. 

Analysis  of  the  adaptive  systems  transient  will  require  sharper  estimates  of  signal 
bounds  and  rate  of  convergence  than  currently  exist.  Consider  the  ideal  case  of  perfect 
model  matching,  i.e.,  when  there  exists  a  constant  unique  setting  of  the  adaptive 
parameters  which  produce  zero  error  for  all  inputs.  In  this  situation  although  it  is 
possible  to  prove  global  stability  and  exponential  parameter  convergence,  the  system 
states  can  be  arbitrarily  large  and  the  theory  does  not  offer  guidelines  for  adjustment, 
e.g.,  Goodwin  and  Sin  (1984).  Local  stability  analysis  based  on  the  method  of 
averaging  —  which  is  valid  also  in  the  non-ideal  case  —  provides  some  transient 
information  but  is  restricted  to  parameter  trajectories  which  vary  slowly  in  a  convex 
subset  of  the  constant-parameter  stability  set,  see  e.g.,  Astrom  (1983,  1984),  Bodson  et 
al.  (1985),  Riedle  and  Kokotovic  (1986),  and  Anderson  et  al.  (1986).  Certainly  one 
can  argue  that  the  latter  is  not  restrictive  in  practical  system  tuning  when  the  plant  is 
slowly  varying  and  initial  parametrizations  are  close  to  a  tuned  setting.  The  drawback 
is  that  although  the  convergence  rate  is  exponential,  it  is  also  very  slow,  whereas 
simulations  show  that  onset  of  instability  may  produce  very  rapid  learning,  see  e.g., 
Anderson  (1983).  Moreover,  estimates  of  the  convergence  rate  and  the  region  of 
attraction  obtained  from  the  method  of  averaging  can  be  quite  conservative,  and  hence, 
do  not  provide  a  complete  representation  of  the  achievable  transients  leading  to  good 
performance,  e.g.,  Marcels  (1986). 


In  this  paper  we  present  an  approach  to  the  analysis  of  the  transient  based 
primarily  on  the  use  of  fixed-point  theorems,  e.g.,  Kosut  and  Bitmead  (1986).  Here 
we  discuss  how  the  Contraction  Mapping  Principal  can  be  used  in  conjunction  with 
other  methods  of  analysis  including  small  gain,  passivity,  and  averaging.  Simulations 
will  be  provided  in  the  final  version  of  the  paper  which  illustrate  the  use  and 
limitations  of  the  theory.  The  theory  presented  here  is  limited  to  a  simple  continuous¬ 
time  gradient  algorithm.  Extensions  as  well  as  discrete  time  algorithms  will  be 
presented  in  the  final  version. 
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2.  ADAPTIVE  ERROR  SYSTEM 

A  genera]  structure  for  an  adaptive  system  is  shown  in  Fig.  1  and  is  described  by 
the  operator  equations 

f  e  1  f  W  1  f  ?ew  Peu  1  f  W  1 


u  =  -  F(0)<}>  (2.2) 

6  =  f2(0o,  e,  4>)  ,  0(ro)  =  0O  (2.3) 

The  adaptive  system  consists  of  three  subsystems: 

(1)  the  plant  subsystem  P  which  takes  exogenous  inputs  w  —  consisting  of 
references  and  disturbances  ~  and  the  adaptive  control  inputs  u  into  the 
error  e  and  regressor  <f>; 

(2)  the  control  subsystem  which  transforms  <)>  into  u  via  the  control  matrix 
F(0),  which  is  parametrized  by  0  the  adaptive  parameter  vector;  and 

(3)  the  adaptation  subsystem  Q  which  uses  the  error  signal  e,  the  regressor 
signal  <)>,  and  the  initial  parameter  value  0O  to  generate  the  parameter  0. 


"F>v  1 . 


5 


The  structure  of  (2.1)-(2.3)  can  describe  most  of  the  standard  forms  of  either 
continuous-time,  discrete-time,  or  hybrid  adaptive  controller.  Details  on  this  structure 
can  be  found  elsewhere,  e.g.,  Kosut  and  Johnson  (1983),  Anderson  et  al.  (1986).  For 
example,  a  typical  form  for  (2.2)  is  the  bilinear  structure 

u  =  -  6T4>  (2.4) 

Typical  forms  for  (2.3)  include  the  simple  continuous  time  gradient  algorithm 

0  =  etye  (2.5) 

or  the  discrete-time  normalized  gradient  algorithm 

50  =  e  — (2.6) 

i  +  m  4  r 

where  5  is  the  difference  operator,  i.e.,  (5 0)(O  =  0(0  -  0(f-l).  In  (2.5),  (2.6),  e 
and  p  are  positive  constants  and  |  *  |  is  the  Euclidean  norm,  i.e.,  |  <j>  |  =  (<pT<p)1'2. 

For  illustrative  purposes  we  will  concentrate  first  on  the  continuous-time  gradient 
algorithm  (2.5).  A  convenient  form  for  analysis  is  the  adaptive  error  system  which  is 
formed  by  introducing  the  parameter  error 

8(0  =  0(0  -0*  (2.7) 

We  refer  to  0*  as  a  tuned  parameter,  which  is  a  constant  vector  of  parameters 
producing  desirable  performance  properties  of  (2.1),  (2.4).  When  0(0  is  held  fixed  at 
0*  the  resulting  system  is  referred  to  as  the  tuned  system  and  is  described  by 


«.  =  -  F(0*)<(>*  =  -05<J>.  (2.8) 

The  signals  e*,  <>•,  and  u*  are  referred  to  as  the  tuned  error,  regressor,  and  control, 
respectively.  If  P  is  a  linear-time-invariant  (LTI)  operator  with  transfer  matrix  P(s), 
then  it  can  be  shown  (Kosut  and  Friedlander,  1985),  that  the  adaptive  system  (2.1), 
(2.4),  and  (2.5)  can  be  described  in  error  form  by 

0  =  E<j>e  ,  0  (fg)  =  0o 

e  =  e.-H^W)  (2.9) 


sW-  V'  <*  •%  v,  va  .v: 


•\  ,  -V -• A -W 


where  H#,,  H$v  are  stable  LTI  operators,  dependent  on  6*,  with  transfer  functions 
Hqv(s),  respectively.  Stability  of  H^,  H^v  follows  from  the  definition  of  the 
tuned  parameter  setting. 

The  system  (2.9)  can  be  shown  to  be  globally  stable,  i.e.,  stable  for  all  0^  e  IRP, 
provided  that  H^is)  is  SPR  (strictly  positive  real),  the  tuned  error  e*(r)  is  zero,  and 
is  bounded.  Zero  tuned  error  can  be  relaxed  to  e*(t)  bounded  and  decaying 
exponentially  fast  to  zero.  Moreover,  if  <u(r)  is  persistently  exciting  then  (2.9)  is 
globally  exponentially  stable,  i.e.,  stable  for  all  8^  e  1RP.  These  results  are  typical, 
but  not  particularly  useful  for  a  transient  analysis.  In  the  first  place  the  signal  bounds 


are  crude.  Suppose  for  example  that 

He,  =  1  (2.10) 

which  is  certainly  SPR.  Suppose  also  that  3  constants  K  >  0  and  a  >  0  such  that 

t 

I  tfVXO  I  <  j  Ke-W |  V(T)  | dz  (2.11) 

o 

We  then  have  the  following  bounds: 

(a)  II©  IL  <  I  5b  I  (2.12a) 

(b)  ||4>t8||2  <  |8bl/(2e)1/2  (2.12b) 

(c)  <  W*)\  Sb  l/(2e)1/2  (2.12c) 

(d)  ||4>  -  <ML  <  [Kl(2a)if2]\  6b  \Wm  (2.12d) 

If,  in  addition,  <|>*(f)  is  persistently  exciting,  i.e.,  3  Tq  >  0,  Oq  >  0  such  that 

S+Tq 

Kin{  j-o  /  >  <*<,  ,  v  s  >0  (2.13) 

then  6  (r)  -»  0  and  <K0  -  4>*(r)  — >  0  exponentially  with  rate  of  convergence  no 
slower  than 


where 


2tToOo 

(1+eToPo)1 


(2.15a) 


n  =  ■ 


Po  =  ll<Mll  (2.15b) 

Hence,  for  small  e  we  have 

X  -» cocq  ,  as  e  -» 0  (2.16) 


Observe  also  that  for  large  e  the  convergence  rate  decreases,  i.e.. 


X 


op 

^(PqT'o)2 


as  e  — »  *o 


Thus  there  is  a  limit  to  the  convergence  rate  as  seen  by  (2.16),  (2.17). 


(2.17) 


The  weakness  of  the  bounds  in  (2.12)  and  the  convergence  rate  estimate  (2.14)  is 
that  they  are  conservative.  It  is  often  prudent,  for  example,  to  choose  a  small  e.  By 
(2.12)  we  then  have  boundedness,  but  very  large  values  can  accrue  since 
||<t>  -  <ML  “  (e)~1/2  ,  likewise  the  convergence  rate  estimate  is  very  low.  Simulations 
have  verified  this  behavior  in  certain  cases. 

One  may  well  ask  the  question:  are  these  results  intrinsic  to  the  adaptive  system 
or  merely  a  result  of  the  stability  theory?  The  answer  may  ultimately  turn  out  to  be 
both,  but  for  now  we  concentrate  on  the  limitations  of  theory.  It  is  obvious  that  when 
Hgv(s)  is  not  SPR  the  global  theory  breaks  down  completely.  We  turn  then  to  local 
theory,  e.g.,  the  method  of  averaging.  For  example,  from  Anderson  et  al.  (1986)  it  is 
shown  that  if 


T 


Re  hmj;  }  >  0  (2.18) 

and  Gfo  is  in  a  convex  subset  of  the  constant  parameter  stability  set,  then  for  all  small 
e  >  0, 


lim  sup  |  B  (r)  |  =  [1  +  0(e)]0(||e*IL)  (2.19) 

f 

Moreover,  6  (r)  approaches  the  above  limit  set  exponentially  fast  with  a  convergence 
rate  of  0(e).  Thus,  local  stability  is  insured.  We  also  have  local  instability  insured 


if  any  eigenvalue  of  the  matrix  in  (2.18)  has  a  negative  real  part. 

The  results  in  (2.18)  and  (2.19)  are  intuitively  pleasing  since  they  are  robust,  e.g., 
if  R cH^fJca)  >0  at  the  dominant  frequencies  of  $*(0  then  (2.15)  will  hold.  Thus, 
can  be  “almost”  SPR,  at  least  where  it  counts.  Also,  the  instability  result  keeps 
us  from  proposing  kakimaymy  excitation  signals.  The  drawback,  however,  is  that  the 
bounds  required  on  e  are  conservative,  and  simulation  results  allow  for  much  larger 
values,  see  e.g.,  Mareels  et  al.  (1986).  Secondly,  if  is  SPR  then  we  know  from 
the  global  theory  that  e  need  only  be  positive,  yet  the  averaging  result  requires  e 
small.  In  the  next  few  sections  we  hope  to  shed  some  light  on  these  issues. 

3.  LINEAR  ANALYSIS 

We  now  consider  only  the  prototypical  ideal  adaptive  error  system 

e  =  -  £4>//(<j>tG)  ,  0(ro)  =  e0  (3.1) 

$  =  $*  -  G(<frT6)  (3.2) 

For  convenience  we  have  dropped  all  exogenous  notation.  We  assume  that  the 
operators  H  and  G  are  linear  integral  operators  which  are  exponentially  stable,  i.e., 
as  in  (2.11)  we  have  for  some  positive  constants  K  and  a  that 

i 

|  {Gu){t)  |  or  |  {Hu){t)  I  <  I  u{ t)  \dx  (3.3) 

o 

Although  system  (3.1)-(3.3)  docs  not  at  all  represent  all  the  myriad  variants  of 
adaptation  mechanisms,  we  assert  that  any  scheme  for  obtaining  a  transient  analysis  of 
this  system  will  apply  to  the  variants  as  well.  Before  describing  some  approaches,  we 
begin  with  some  examples  which  illustrate  transient  phenomena. 

We  start  by  examining  (3.1)  and  (3.2)  separately,  as  if  they  were  decoupled  linear 
systems.  Consider  first  (3.1),  and  assume  that  <J >(r)  is  a  given  function  of  time.  If  we 
take  the  scalar  case  with  H  -  1,  (3.1)  becomes, 

e  =  -e<»2(oe  ,  6(0)  =  e0 

Assuming  $(0  is  our  “input”  the  exact  solution  of  (3.4)  is 


(3-4) 


9 


I 

0(0  =  0O  exp  {  -  e  |  $\z)dx  } 


It  is  clear  that  |  6(0  I  <  I  6o  I  as  stated  in  (2.12a).  Suppose  6(0  is  given  by 

6(0  =  ae~^  +  sin  ct  (3.6) 

where  b  and  c  are  positive  constants.  Suppose  that  for  t  e  {0,  fj],  ae'1*  ~  a  »  1. 
Then 


We  then  have 


<j>(0  =  a  ,  t  e  [0,  t,] 


0(0  =  0O  exp  {  -  tah  }  ,  t  e  [0,  /j) 


As  time  goes  on  6(0  — >  sin  ct  and  hence 


0(0  ->  60  exp  {  -  t  -  sin  2 ct} 


Thus,  6(r)  — »  0  exponentially  as  expected,  since  (3.6)  is  (asymptotically)  persistently 
exciting  for  (3.4).  Observe,  however,  that  the  early  convergence  rate  from  (3.8)  is  Co2 
which  may  be  considerably  larger  than  the  final  convergence  rate  of  c/2  from  (3.9). 
It  is  precisely  this  kind  of  transient  phenomena  which  needs  to  be  addressed. 

Now  consider  (3.2)  separately  and  suppose  that 


G(s)  =  — 
s+* 


,  a  >  0 


and  that 


System  (3.2)  becomes 


0(0  =  -  re""  ,  a  >  X  >  0 


6-6*  =  -x 


x  =  -  (a  +  0(0)*  +  0(O6*tt  ,  *(0)  =  0 


(3.10) 


(3.11) 


(3.12) 


The  solution  x(t)  is  then  explicitly  given  by: 


F(r,  t)0(t)4>.(t )dx 


F(t,  t)  =  exp  {  -  j  (o  +  0(j))<fc} 


(3.13) 
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It  is  clear  that  x(t)  ->  0  exponentially.  However,  x(t)  may  obtain  quite  a  large  peak 
value,  particularly  if  X  «  a.  Recall  that  X  =  0(e)  as  seen  from  (2.15)-(2.17).  In 
fact  when  X  «  a  the  initial  behavior  of  (3.12)  is  like 


x  =  -  (a  -  r)x  -  r$*(t) 


(3.14) 


If  a  -  r  <  0,  or  even  near  zero,  then  x(t)  will  initially  grow  until  a  -  re~^ 
becomes  negative.  In  fact  the  time  to  reach  the  peak  in  x (/)  is  approximately 


tpk  ~  T  Wa) 


(3.15) 


Hence,  the  “more”  initial  instability  (r  >  a)  the  longer  it  takes  to  turn  the  system 
around  and  the  bigger  will  be  the  value  of  |  x(tpk)  |.  [The  full  paper  will  contain 
simulations  of  those  phenomena.] 

We  now  discuss  analysis  techniques  which  can  predict  the  demonstrated  transient 
behavior  for  (3.1)  and  (3.2)  separately.  In  the  next  section  we  will  see  that  these 
separate  linear  analysis  can  be  joined  by  applying  fixed  point  theory. 

For  systems  (3.1)  consider  the  interval  [r0,  r0  +  7],  Define  the  sample  average 


matrix 


We  then  have  that 


where 


ta*T 

/?  =  1  J  4>(r)4 >\t)dt 


er(/0  +  T)Q(t^T)  <  er(r0)[/  -  Ti/?]0(ro) 


2 eT 

(l+eP7)2 


(3.16) 


(3.17) 


(3.18) 


P  =  sup 


(3.19) 


The  proof  of  (3.17)-(3.19)  is  omitted  here  but  follows  directly  by  using  the  Lyapunov 
function  V(r)  =  |  8(r)  |2  and  then  differentiating  along  (3.1).  Observe  that  for  all 
e  >  0,  we  have  ti  e  (0,  1)  and  also  that 


which  verifies  (2.12a).  It  is  usually  the  case  that  R  is  at  worst  rank  deficient,  i.e., 
1  <  rank(R)  <  dim(9),  in  which  case  it  can  be  shown  a  submatrix  of  /  -  T[R  of 
order  rank(R)  is  contractive,  i.e.,  a  linear  combination  of  6  parameters  is  contracting 
over  [r0,  r0  +  T).  A  similar  result  can  be  obtained  for  the  system 

6  =  -  e<J)//(<t)re)  (3.21) 

where  H  is  not  necessarily  SPR  but  can  be  expressed  as  H  =  H  +  H  where  H  is 
SPR  and  H  is  “small".  The  result  is  analogous  to  (3.17H3.20)  but  with  the 
exception  that  t\\H\\  is  required  to  be  sufficiently  small  and  R  is  given  by 

k&T 

R  =  ■£:  j  (//  <»(//  ®Tdt  (3.22) 

<6 

An  earlier  version  of  this  result  is  in  Anderson  et  al.  (1984). 

Now  consider  system  (3.2)  as  if  8(f)  had  given  properties.  It  is  convenient  to 
define  the  regressor  error 

KO  =  <K0-<M t)  (3.23) 

We  then  have  (3.2)  written  as 

^  =  -  G(8r(<)>*  +  $))  (3.24) 

Hence, 

♦  =  -  C(eT<M  (3.25) 

where  Q  is  the  linear  integral  operator 

Q  =  (Z  +  GOVC  (3.26) 

whose  kcmal  satisfies 

i 

Q{t,%)  =  G(f,  X)  -  jG(f,  s)QT(s)Q{s,  x)ds  (3.27) 

t 


It  follows  from  (3.3)  that  if 


which  may  be  obtained  from  (3.3)  and  (3.28)  by  applying  the  Bellman-Gronwall 
Lemma.  If  a  >  >X  then  the  peak  value  of  |  <f</)  |  could  be  as  large  as,  but  no 
larger  than, 

IlflL  <  m* IL  7  Jrtk  (3.30) 

For  small  X  the  term  can  be  quite  big  unless  of  course  r  is  small,  i.c.,  0(0 
is  small  to  begin  with.  From  the  previous  example  we  saw  from  (3.8),  (3.9)  that  even 
though  X  may  be  of  order  e  ultimately  (3.9),  it  is  possible  during  certain  transients 
for  X  to  be  large,  e.g.  (3.8)  Further  examples  will  be  presented  in  the  final  paper. 

4.  NONLINEAR  ANALYSIS 
4.1  Fixed  Point  Theory 

In  this  section  we  show  how  the  linear  analyses  of  Section  3,  separately  applied 
to  (3.1)  and  (3.2),  can  be  brought  together.  This  is  accomplished  by  application  of  the 
Banach  Fixed  Point  Theorem  (FPT),  Le.,  the  Contraction  Mapping  Principal  (CMP). 
We  need  the  following  definitions. 

If  Af  is  a  subset  of  a  Banach  space  B  with  norm  ||||,  and  T  is  an  operator 
mapping  Af  -*  B,  then  T  is  a  contraction  on  M  if  3  constant  o  e  [0,  1)  such 
that 

IIFx-Iyll  <  oHx-yl|  ,  V  xy  €  Af  (4.1) 

The  constant  o  is  the  contraction  constant  for  T  on  Af.  A  fixed,  point  of 
r  :  Af  — »  Af  is  a  point  (function)  x  e  Af  such  that  x  =  Tx.  We  now  have  the 
following  theorem  as  stated  in  Hale  (1969). 

Contraction  Mapping  Principal  (CMP):  If  M  is  a  closed 
subset  of  a  Banach  space  B  and  P.  M  -*  M  is  a  contraction 
on  Af,  then  T  has  a  unique  fixed  point  in  Af. 


n 

♦ 
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.v 


In  order  to  apply  the  CMP  to  the  adaptive  system  (3.1),  (3.2)  we  need  to  identify 
the  operator  T  and  the  space  M.  For  example,  consider  the  operator  T  defined  by 


<$>  =  <j>*  —  GCG7  <>) 


n  e  -» e : 


e  =  -  e4>//(4»re)  ,  e(0)  =  0O 


(4.2a) 

(4.2b) 


It  is  clear  that  fixed  points  of  T  are  solutions  to  the  adaptive  system,  i.e.,  0  =  T0  is 
equivalent  to  (3.1)-(3.3).  In  fact,  the  solution  to  (3.1)-(3.3)  is  unique  and  is  also  a 
fixed  point  of  I".  A  convenient  choice  for  the  space  M  is 

W  =  {0e  C[  0,  7]:  11011m  <  r}  (4.3) 

where  we  choose  the  norm  on  M  as 

(4.4) 


,ie,W  =  f  w*  I  ehQ(t) 


with  X  and  r  chosen  positive  constants.  We  proceed  to  apply  the  CMP  as  follows. 
First,  we  establish  that  T  maps  A#  -» M.  Hence,  pick  any  0  €  M ,  i.e., 

|  0(r)  |  <  re~h  ,  and  solve  (4.2a).  With  0(0  so  chosen  (4.2a)  is  a  linear  system  and 
the  analysis  in  Section  3  applies.  In  particular,  (3.29)  holds.  We  proceed  to  (4.2b) 
which  is  again  linear  since  $  (0  is  obtained  from  (4.2a)  and  is  not  dependent  on 
0(0-  Thus,  the  results  of  (3.16)-(3.20)  apply.  We  then  enforce  ||0||m  <  r ,  which 
restricts  the  relation  among  0O  ,  K,  a,  X,  and  r  so  that  T  =  M  -*  M.  In  this  case 
we  also  obtain  that  T  is  contractive  on  M  under  the  same  conditions  causing 
T:  M  M.  These  conditions  will  be  stated  in  the  full  version  of  the  paper. 

The  choice  of  M  in  (4.3)  leaves  too  much  imprecision  in  0(0-  A  “finer” 
choice  is  to  select  the  norm  on  M  to  be, 


||0JIm  =  sup  |e*0(O 

» 6  to.  71 


(4.5) 


where  B  €  with  Re  X(B)  >  0.  This  allows  for  greater  possibilities  in  the 

transient  of  0(f). 
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Other  choices  of  r  are  also  investigated.  For  example,  consider 

» 

<j>  =  4>*  +  y 

T:  \jr  -»  y  :  '  0  -  -  e#7©  ,  6(0)  =  0O  (4.6) 

V  =  -Off  6) 

Again  solutions  of  (3.1  )-(3.3)  are  fixed  points  of  T  and  vice  versa.  In  (4.6)  T  maps 
regressor  error,  i.e.,  deviations  about  <>*.  Application  of  the  CMP  to  T  in  (4.6) 
yields  conditions  on  transient  behavior.  These  are  verified  by  simulations. 

4.2  Method  of  Averaging 

One  further  possibility  to  predict  transient  behavior  is  to  utilize  the  method  of 
averaging  over  finite  time.  This  analysis  is  not  then  dealing  with  stability.  In  this 
analysis  we  can  return  to  the  system  of  (2.1),  (2.4),  (2.S)  which  in  state  form  is 
described  by 

0  =  ,  6(0)  =  60e  TRP 


e  -  c1. x 


,  4>  =  Dx 


x  =  A(0)x  +  U(0)H>(r)  ,  x(0)  =  xq  e  JRn 

(Recall  that  6(r)  is  the  actual  adaptive  parameter,  not  the  deviation  from  6*).  To 
apply  averaging  requires  a  time-scale  decomposition  (see,  e.g.,  Riedle  and  Kokotovic 
(1986),  Anderson  et  al.,  (1986)).  This  is  accomplished  by  introducing  the  frozen 
system  state  iff,  0),  i.e.,  for  each  fixed  0  €  IR*  , 

r 

xlf,  0)  =  «M(e)x0  +  J«(,^(0)B(0)w(t)dx  (4.8) 

o 


and  the  state  error 


z(0  =  x(t)  -  x(t,  0(0) 


The  result  is  that  for  sufficiently  small  time  intervals  -  not  necessarily  small  e  --  the 
parameter  transient  behaves  like  the  solution  to  the  linear  system 

±  (e  -  e0)  =  tw,  e0)cK  e0)  -  *«>  eo)(0  -  M  (4.io> 

where  $(r,  0O)  and  e(t,  0O)  are  the  frozen  system  regressor  and  error,  respectively, 
and  R(t,  0O)  is  given  by 

*(r,0o)  =  (4.11) 

In  (4.11),  Hev  and  H^v  have  their  usual  definitions,  vis  a  vis  (2.9),  except  now  are 
dependent  on  0O  rather  than  0*.  Averaging  analysis  proceeds  from  here  in  the  usual 
way  and  will  be  documented  in  the  full  version  of  the  paper. 

5.  CONCLUSIONS 

In  this  investigation  of  the  transient  properties  of  adaptive  control  systems  we 
have  shown  that  some  of  the  interesting  phenomena  can  be  analyzed.  The  tools  for 
analysis  involve  a  combination  of  small  gain  theory,  passivity,  and  the  method  of 
averaging  with  these  all  linked  together  by  the  Contraction  Mapping  Principal. 
Although  each  of  these  tools,  in  principal,  involves  straightforward  calculations,  it  is 
clear  that  the  level  of  complexity  of  a  realistic  adaptive  system  is  well  beyond  hand 
calculation.  Hence,  an  area  for  further  work  is  in  the  development  of  software  tools 
which  can  eliminate  some  of  the  tedious  parts  of  the  analysis. 
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A  from  tuned  signals  e« ,  +a,  e« 


ABSTRACT 

Tne  use  of  fixed  point  theorems  is  considered  for 
the  stability  analysis  of  adaptive  systems.  The 
particular  fixed  point  theorems  considered  are  the 
Contraction  Happing  Principle  of  Banach  and  the 
Schauder  Fixed  Point  Theorem.  It  is  shown  how  the 
contraction  property  can  be  achieved  by  exponential 
stability  of  the  homogeneous  part  or  the  linearized 
adaptive  system.  The  region  -of  linearization 
validity  is  estimated  by  considering  fixed  as  well 
as  adaptive  tuned  systems.  Fixed-point  theorems 
are  shown  also  to  be  useful  for  a  transient 
analysis  of  the  adaptive  system. 


^ .  INTRODUCTION 

A  general  structure  for  an  adaptive  system  is 
shown  in  fig.l  and  Is  described  by  tne  operator 
equations 


H(e)w  - 


H,u(e) I 


K,w<«)| 


0.1*) 


6  •  8(e,0) ,  8(0)  -  8. 


(1.1b) 


The  adaptive  system  shown  consists  of  three  basic 
subsystems: 

*1.  The  adaptation  subsystem  0  which  uses  the  error 
signal  e,  the  regressor  signal  8,  and  the 
lntltlal  parameter  value  8.  to  generate  the 
parameter  8, 


2.  The  error  subsystem  H.„(8)  which 


Is 

the 


3. 


parametrically  dependent  on  8  and  takes 
exogenous  inputs  w  -  consisting  of  reference, 
disturbance  and  noises  -  and  produces  the  error 
signal  s  used  for  adaptive  parameter 
adjustment. 

The  regressor  subsystem  H,w(8),  also  dependent 
on  8,  maps  w  into  the  regressor  signal  ,.  The 
regressor  is  derived  from  measured  signals  end 
usually  la  constructad  so  as  to  rapraaent  the 
states  of  the  system  model. 

The  decomposition  of  the  adaptive  system  shown  In 
(1.1)  la  non-standard  In  that  the  plant  and/or 
controller  la  not  explicitly  visible.  This 
structure  Is  chosen  to  highlight  signal  properties 
and  general  oparator  characteristics  rather  than 
plant/controller  structures  and  perametrlzallons. 
8  similar  decomposition  can  be  found  In  Kosut  end 
Anderson  (I960)  and  Anderson  et  al.  (1986). 

In  this  paper  we  will  show  how  the  Banach  Flxtd 
Point  Theorem  (the  Contraction  Mapping  Principle) 
can  be  applied  to  study  the  local  stability 
properties  of  (1.1).  To  apply  the  theorem  we 
utilize  tne  techniques  of  linearization  about  a 
tunad  system  trajectory.  In  this  paper  the 
constant  parameter  tuned  system  concept  Introduced 
In  Kosut  and  Frladlandar  (1986)  la  extended  to  th* 
mors  general  oast  where  the  tunad  system  oan  b*  an 
Ideal  adaptive  system. 


2.  THE  TUNCU  S1STEH  AND  LINEARIZATION 

In  this  section  we  develop  tn  Incremental 
version  of  (1.1)  which  couples  the  deviations  of  e. 


i)'i  • 


does  not 


Ov'py  ow*‘ 

j*  f  v  /  reproduction 
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8.  8  from  tuned  signals  e. ,  f ■ ,  Sa.  He  first 
develop  the  Incremental  form  and  then  discuss  the 
meaning  of  the  tuned  signals.  For  brevity  we 
Introduce  the  notation 


e 

ei 

c 

X  • 

8 

0  *•  - 

0  X  • 

l 

8 

6* 

A 

(2.1) 


let  Q(x )  denote  the  operator 


Q(x)  - 


eu(8)w 
,u(8)w 
0(8. e) 


(2.2) 


Thus,  (1.1)  is  equivalent  to  the  single  expression 


Q(x) 


(2.3) 


Using  (2.1)  with  (2.3)  gives 


Q(xe  ♦  X)  -  xa 


(2.9) 


which  may  also  be  written  as 
5  •  t  »  F(i) 


(2.5) 


where 


Q(xa)  -  xa 


«e 

Hew(0a)w  -  ei 

68 

- 

H,w(0e)w  -  0a 

6t 

Q(ca,4a)w  -  0a 

(2.6) 


and 


F(x)  ■  Q(xa*x)  -  Q(x. ) 


H.w(  0a*8)w  -  He„(8.)w 
H,w(B**®)w  —  H,w( 0. ) w 
Q(ee»e,0a»8>  -  O(e.,0.) 


(2.7) 


The  abstract  nonlinear  operator  representation 
(2.3)  Is  completely  equivalent  to  the  adaptive 
aystam  (1.1).  System  (2.3)  Is  referred  to  as  the 
error  system  version  of  (1.1)  corresponding  to 
tuned  aignals  e*,0a,8a.  One  of  the  reasons  for 
working  with  the  error  system  rather  than  the 
original  eystem  is  that  robust  stability  of  the 
original  system  Is  more  easily  expressed  with 
respect  to  the  system  behaviour  ralatlve  to  the 
ideal  (tuned)  behaviour.  He  will  return  to  this 
point  later  in  the  peper. 

He  now  eeteblleh  e  conceptual  framework  for 
linearization.  Suppoae  that  F(z)  la  locally 
Frechel  differentiable  with  reaped  to  x  for  all  x 
In  some  neighborhood  of  x  •  0,  e.g.  ||x||  t  r  where 

H*  1 1  is  e  nora  on  a  Btntoh  apace  B.  Then,  for  all 
x||  Sr,  we  Bay  act  as  if  there  were  a  linear 
operator  l  such  that 


Mil  •  *5^  (ih 


x  •  0 


>V- 


#? 


0 

0 

(WW  »»>,,“ 

e 

0 

0 

(2H,W/Se)6iw 

i 

Ofl/2e)  , 

e« ,  +• 

e«,4* 

0 

l 

Hence,  (2.5)  is  equivalent  to 
x  -  6  ♦  L(x)  ♦  Mx) 


A(x)  -  F(x)  -  Ux) 


Assuming  for  the  moment  that  H(ft)w  and  sKe,*> 
are  sufficiently  regular  operators,  then  L  defined 
in  (2.8)  will  be  a  linear  integral  operator  with  a 
locally  integrable  kerna)  function.  Under  suitable 
conditions  on  the  existence  of  solutions  to  (2.9). 
we  can  express  (2.9)  equivalently  by 


x  -  xL  ♦  N( x ) 


xL  -  (I-L)-'« 


N(x)  ■  (1-L)*‘g(5) 


(2.10b) 

(2.10c) 


The  signal  xj,  1s  referred  to  as  the  linearized 
error  system  .response.  Intuitively.  If  K(0)  •  0 
and  1 1 N( x l ) 1 1  Is  sufficiently  small,  then  ||x|| 
will  be  snail.  This  would  establish  that  the 
adaptive  system  behaves  very  nearly  like  the  tuned 
system.  This  result  can  be  rigorously  established 
by  appealing  to  the  Banach.Flxed  Point  theorem,  or 
as  it  is  often  referred  to.  the  Contraction  Happing 
Principle.  Before  it  is  stated  some  definitions 
are  needed. 

If  M  is  a  subset  or  a  Banach  space  B.  and  T  is 
an  operator  mapping  M»B,  then  T  is  a  contraction  on 
H  if  there  is  a  constant  o  c  [0,1)  such  that 


|Tx-Ty||  S  o||x-y||,  Vx,y  c  M 


The  constant  o  is  called  the  contraction  constant 
for  T  on  h.  A  fixed  point  of  T:H^  is  a  point  xcM 
such  that  x  -  Tx,  i.e.  xcM  is  invariant  under  T. 


Theorem  1 ;  Contraction  Mapping  Principle 
If  H  is  a  closed  subset  of  a  Banach  space  B  and 
T:M«M  is  s  contration  on  M,  then  T  has  a  fixed 
point  in  M.  o 


This  statement  of  the  Contraction  Happing 
Principle  can  be  found  in  Hale  (i960).  Applying  it 
to  the  error  system  (2.9)  gives  the  following 
result. 


Theorem  2:  Linearization 

Let  B  denote  a  Banach  apace  and  let  Br  denote 
the  closed  subset  of  B  defined  by 


Bf  ■  If  (  8  i  | | f | |  i  r)  (2.12) 

where  r  Is  I  positive  constant.  Assume  that: 


A! :  N(0)  •  0  (2.13) 

A2 1  1 1 N ( x ) -N ( y )  1 1  i  KNr||x-y||,  Vx,ytBr  (2.1k) 


Under  these  conditions,  if  r>0  satisfying 


(l)  KNr  <  ' 

til)  ||xL||  S  r(l-Kgr) 


then  a  unique  soluton  x  of  (2.9)  such  that 
||i|!  *  r  U.17) 


Remark :  There  are  eny  number  of  choices  for  the 
Banach  space  B  in  Theorem  2.  Useful  choices 
Include  the  Lebesgue  spaces  Lp(J),  with  the 
usual  norm  definitions,  and  the  apace  of 
continuous,  bounded  fucntions  C(J)  with  norm 
1 1 x 1 1 c  •  eupl |x(t)| :  tcJ),  where  typically  J  *  R* 
or  J  •  [0,T],  T  <  •.  By  Introducing  exponentially 
weighted  multipliers  on  the  operators  and  signals 
In  (2.9),  we  can  then  think  in  terms  of  a  norm  on 
C[0,TJ  such  that  ||x||c  *  sup(  |eux(t)| :  tiJ)  where 
1  is  a  chosen  positive  constant.  In  this  way  we 
obtain  convergence  rate  Information  about  the 
system  (2.9).  and  hence,  the  adaptive  system. 

Conditions  A1  and  A  2  establish  the  local 
Llpschltz  continuity  of  N(x).  Moreover, 
| | x | |  S  r  •>  | | N ( x ) | |  S  KNr*.  This  quadratic  bound 
on  N(x)  is  Insured  if  f(x)  is  Frechet 
differentiable  for  ||x||  S  r,  because  then  it 
follows  that  | | x | |f  i  r  *>  ||a(x)||t  -  0(||x|jf). 
The  truncated  norms  are  removed  under  the  action  of 
(1-L)“ 1  on  A(x).  I.e.  N(x)  -  (l-L)'lA(x). 


Proof:  Let  T  be  defined  by  the  right  hand  side  of 

(2.10),  i.e. 


T(x)  -  xL  ♦  (l-L)-‘A(x) 

Observe  that  from  (2.16),  T:Br*Br,  I.e. 
I |Tx| [  S  ||xL||  ♦  K„r*  S  r.  VxtBr 


Furthermore,  (2.16)  Implies  that  T  is  a  contraction 
on  Br,  i.e. 


||Tx-Ty||  S  Kgr  ||x-y||.  9x.ytBr 


with  contraction  constant  K^r  <  1.  By  Theorem  1, 
there  is  a  unique  fixed  point  xcBr  of  T.  Since 
x  •  Tx  is  equivalent  to  (2.10)  which  is  equivalent 
to  (2.9),  the  theorem  Is  proved.  □ 


Discussion 

Theorem  2  asserts  that  the  adaptive  system 
signal  trajectory  is  close  to  that  or  the  the  tuned 
system  (r- small)  If  the  linearized  response  x^  As 
small.  Observe  that  the  smallest  value  of  r  is 
limited  by  the  size  of  ||xl||  which  depends  on  the 
Interaction  of  (1-L)“*  with  6.  We  will  see  later 
that  6  varies  considerably  with  the  choice  of  tuned 
signals.  Also,  depending  on  this  choice,  both  L 
and  6  may  contain  integrators  and  thus,  although  x^ 
can  be  bounded,  neither  (1-L)"1  nor  6  may  be 
individually  bounded. 

Theorem  2  gives  qualitative  information 
regarding  the  effect  of  the  nonlinear  term  N(x). 
Condition  (2.17)  shows  that  the  stronger  the 
nonlinearity  (larger  Ku),  the  smaller  the  allowable 
1 1=1.11  - 

Beyond  this  type  of  Information,  Theorem  2  is  a 
little  opaque  because  the  Internal  dynamics  of  the 
operator  is  not  visible.  Even  so,  we  can  still 
regard  as  good  the  intuitive  idea  that  a  small 
linearized  error  response  la  assentlal  for  robust 
behaviour  near  a  tuned  trajectory. 

Before  proceeding  we  remark  on  the  possibility 
of  relaxing  the  conditions  of  Theorem  2.  This  can 
be  done  by  eliminating  the  uniqueness  requirement 
and  only  establishing  existence.  One  approach  is 
to  use  the  fixed  point  theorem  of  Schauder.  The 
following  theorem  statement  la  In  Hale  (19&0). 


Theorem  3:  Schauder  Fixed  Point  Theorem 
If  K  is  a  convex,  oompact  subset  of  a  Banach 
apace  B  and  T:M«M  is  continuous,  then  T  has  a 
fixed  point  In  M. 
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The  cost  of  eliminating  uniqueness  (T  oontrectlve  ‘r 
on  H)  is  that  H  Bust  now  be  a  oompaet,  convex  ••V- 

eubset  of  B,  whereas  In  Theorsa  1,  M  need  only  be  e  - 

closed  subset  of  B.  For  example,  auppose 
B  >  C[0,T],  the  Banach  apace  of  continuous,  bounded  ' 

functions  on  [0,T),  t<*.  Then  s  convex,  compact 
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»ut>set  of  M  la  the  bounded  end  slew  Halted 
functions  f cB,  i.e*  |f(t)|  i  r,  ¥tc(0,Tj  end 
|f(t)-f(i)|  i  k|l-i  | ,  Vl.icCO.T].  We  point  out  in 
Section  2  that  since  T  oust  be  finite,  Theorem  3  Is 
useful  for  e  transient  analysis  of  the  adaptive 
system. 

For  the  moment,  led  by  Theorem  I  and  Theorem  2, 
ue  will  examine  the  linearized  response. 


system  (3.5)  la  easily  seen  to  be  equivalent  to 
I  .  tpe  '  be.  1(0)  -  B.-8t(0) 

®  •  Hcw( St )u  ■  H|y ( 01 ) V 
0  •  H^w(0»)w  -  H^v( 0«)v 

»  •  •  v  -  ♦  (3.6) 


3.  THE  LINEARIZED  RESPONSE 

In  this  section  ue  will  examine  the  llnearlted 
system  (?.t0b),  l.e* 


where 

11^(61)  •  “  C^v6i(l*C^v6t)  *C^W 

Hgy(8»)  ■  Cfiy  "  Gf  y6l  (  l4C^y0l  )C|y 


In  order  to  be  more  specific  about  (3.1)»  It  is 
necessary  to  be  more  specific  about  the  structure 
of  <1.t)  and  the  choice  of  the  tuned  signals 

e* ,4*,6ft. 

3. 1  Tuned  System 

In  the  formulation  so  far,  the  choice  of  tuned 
signals  is  arbitrary.  The  most  common  choice  is 
where  6«  is  a  fixed  parameter  in  RP  chosen  as  If 
the  plant  were  known,  and  €■»+•  are  generated  as 
shown  in  Figure  2,  i.e.  from 

e«  |Hew(0«)w 

-  H(e.)w  •  (3.2) 

|h^w(0<»)w 

In  this  case  we  have  from  (2.6)  that 


|fi(e«#+»)  -  e.| 


H,w(e.)  -  (i*c,v«i)-'c4u 

H,y(e.)  -  {I«G,w»eT*C,„  (3.7) 

In  the  case  when  e" ,♦*  ,e*  arise  from  the  tuned 
system  In  Fig. 2,  where  e»cRpt  then  (3.6)  becomes 

4  -  epe,  5(0)  *  8»-6e 
e  -  e#  -  Hev( 8e)v 
♦  *  -  HtvU‘i)v 

v  -  t«'8  (3.6) 

The  linearized  system  associated  with  (3.8)  Is 
defined  by  (2.10),  (2.8)  is: 

ti,  •  c[»te.  »  paej,  ♦  ee8i,]>  6i.(0)  *  i. 


Hence,  the  size  of  the  linearized  response  Is 
determined  by  the  ability  or  tha  adaptation 
subsystem  to  hold  8  near  8a.  ' 

an  alternative  scheme  Tor  generating  e*,4i,8a 
Is  shown  in  Fig. 3.  In  this  case  6t  is  not  fixed 
but  is  the  output  or  an  adaptation  aubsystem.  The 
d  .Terence  between  the  system  in  Fig. 3  and  the 
system  in  Flg.t  is  that  {[(•)  and  w  are  ideal 
versions,  or  simplified  models,  or  H(-)  and  w  in 
F^g.l.  For  example.  Fig. 3  could  represent  an  ideal 
adaptive  system  which  is  globally  stable,  whereas 
Fig.1  contains  unmodelled  dynamics  and  disturbances 
which  remain  unaccounted  for  in  the  ideal  case.  In 
this  case  we  have 

Heu(8a)w  ■  Hew(  8>  )w 

4  •  HfM(8a)w  -  H,u(8.)5  (3.4) 

0 


I 


One  of  the  major  differences  between  the  4’a  in 
(3.3)  and  (3.<0  is  that  4  in  (3.4)  is  bounded 
wheres  4  in  (3.3)  may  not  be  bounded.  The  reason 
is  that  0  typically  contains  integrators.  However, 
in  both  oases  (I-L)*‘6  is  bounded,  again,  by  the 
action  or  the  Integrators  in  L,  i.e.  the  terms 
involving  Ifl/Sp. 


3.2  Adaptive  System  Structure 


Consider 
described  by 


adaptive 


(3.5s) 

(3.5b) 


with  adaptation  subsystem  given  by  the  simple 
gradient  algorithm 

0:  i  •  c8e,  8(0)  •  8.  (3.50 


The  operator  C  In  (3.5a)  1»  the  "plant"  and  la 
asauned  her#  to  be  a  Matrix  of  linear 
time- Invariant  operators  with  transfer  function 
C(s).  Using  the  definition  1  •  6-0t  from 


8b  -  -H,v(e.)(8«'6b)  (3.9) 

Observe  that  the  'inputs"  to  (3.9)  are  !•,<■  and 
it.  Since  e«  is  small  by  definition,  the  main 
cause  of  a  large  linearized  response  is  the  initial 
parameter  error  $•  -  Ba  -  Sa.  Since  Hev(6a)  and 
H,v( 8a)  are  stable  and  e*  is  small,  it  rollows  that 
the  stability  of  (3.9)  necessitates  the  stability 
of 

4l  -  -c  8*  Hev(8.)(8e'SL)  (3-10) 

This  system,  which  is  a  linear  time-varying  system, 
can  be  analyzed  by  passivity  methods  [Kosut  and 
Freldlander  0985)],  small  gain  theory  [Anderson  et 
al.  (1982)],  and  averaging  [Nledle  and  Kokotov lc 
0969)].  When  Hev(e»)  is  not  strictly  passive, 
which  is  the  normal  case  for  actual  systems,  the 
lstter  two  approaches  offer  slmiler  stability 
results,  namely,  if  c  <  0  is  sufficiently  email, 
and  if 

4|nln(*vg(8xHey( 8x)e«])  7  0  (3*11) 

then  (3.10)  la  exponentially  stable,  i.e.,  H  1  I, 
o>0  euch  that  solutions  of  (3.10)  satisfy 

|»t(t)|  *  Me-  «<t'!>|5L<T)|,  Vt-T  2  0  (3-12) 

If  condition  (3.11)  falls  to  hold  because  at  least 
one  eigenvalue  is  negative,  then  (3.10)  is 
unstable.  This  means  that  the  adaptive  ayetem  le 
locally  unstable. 

When  c>0  le  small,  the  adaptation  is  alow  and 
(3-9)  is  a  two-lime  acale  aystam  with  8L  the  alow 
variable.  Notice  that  ir  (3.11)  holds,  and  hence 
(3.10)  Is  exponentially  etiixv,  it  is  possible  to 
achieve  bounded  5l>*L.5l  ror  any  Initial 
parametrization  la.  f„ua,  any  Initial 

paramelrizatlon  8a  will  be  tolerated  by  the 
linearized  system.  In  the  actual  system, 
limitations  arise  from  tha  coupling  between  the 
linearized  system  and  the  neglected  nonlinear  terms 
as  la  evidenced  by  the  restrictions  on  ||i|,|| 


a 


ri 


by  (2.16).  Th«s«  can  be  partially  related 
by  initially  taking  into  account  the  two-tine  acale 
system  behaviour  and  developing  a  two-time  acale 
linearisation  theorem  analogous  to  Theorem  2.  Such 
results,  derived  from  the  method  of  averaging  of 
fiogolyubov  and  Hitropolaki  0955),  have  been 
developed  for  adaptive  systems,  e,g .  Aatrom  (198*0, 
Reldle  and  Kokotovlc  (1985b),  Bodson  at  al  (1985), 
Anderson  et  al  0986).  Qualitatively,  these 
results  assert  that  if  ft*  ia  a  stable 
parametrizatlon,  then  for  sufficiently  small  c>0, 
ft(t)  will  move  slowly  through  the  constant 
parameter  stability  set  -  provided  (3*11)  holds  - 
and  arrive  at  an  0(||e»||.)  neighborhood  of  a  tuned 
setting  6«.  Moreover,  o»  can  be  determined  from 
avg[e*$«J  •  0  provided  e«ft«  has  an  average  value. 

Note:  In  general,  the  exponential  stability  of  the 
linearized  system  will  insure  the  contraction 
property  required  in  Theorem  2.  How  to  ensure 
an  exponential  stabllty  in  the  linearized 
system  is  not  completely  solved,  but  at  present 
it  can  be  ensured  from  passivity,  small  gain, 
or  averaging  analyses,  see  for  example  Anderson 
ct  al  (1986).  The  interesting  aspect  of  the 
contraction  (fixed-point)  argument  is  that  the 
mechanism  for  the  contraction  need  not  be 
specified. 

In  the  case  when  e*,ft*,e«  arise  from  the  tuned 
system  in  Fig.  3,  then  (3.6)  can  be  written  as 

e  -  e*  ♦  6e  -  Hev( 6#) v 

ft  ■  ft*  ♦  6^  -  H^v(6«)v 

v  •  ft'l 

4  -  c[,e-,.c]  ,  8(0)  -  0.  (3.13) 

Where  Ae.A,  *r«  given  by  (3.9),  l.«. 

A,  —  Hm(  0.  )w  ■  Hew( 6a )  w 

a,  -  h,u(8.)w  -  fi,w(e.)S  (3.16) 

'The  linearized  systems  associated  with  (3.13)  as 
defined  by  (2.10).  (2.6)  is: 

®(.  •  A.  “  H.y  (  8.  )  (  ,.8^,) 

♦(,  ■  A,  -  H,y(e.)(8.8(,) 

*  ••♦lI.  ’  0  (3.15) 

Comparing  (3.15)  with  (3.9),  the  operators  Key(8»), 
H4y(8.)  In  (3.15)  are  linear  and  time-varying, 
because  8>  is  time-varying  (Fig. 3),  whereas  in 
(3.9)  the  operators  are  linear  time-invariant 

because  8.  Is  fixed  (Fig. 2).  Another  difference  Is 
that  the  "Inputs"  in  (3.19)  are  usually 

significantly  smaller  that  In  (3.9).  In  (3.19)  the 
Inputs  are  Ae,A,  from  (3.19),  whereas  In  (3,9)  the 
inputs  are  ,.e>  and  8«.  Certainly  e*  is  small 

because  e»  Is  email.  Also,  Ae,A,  can  be  small  if 

the  ideal  system  (fig. 3)  1»  not  significantly 

different  from  the  actual  system  (Fig. 2),  However, 
the  'nltlal  parameter  error  8.  can  be  quite  large 
and,  as  already  discussed,  can  limit  the  region  of 
linearization  validity,  e.g.  (2.16),  whereas  In 
(3.15)  tne  effect  of  8.  as  an  Input  is  subsumed  by 
the  linearization  about  the  tuned  trajectory  of  the 
ideal  adaptive  system  (Fig. 3). 

The  stability  analysis  of  (3.19)  proceeds  In 
much  the  same  way  as  the  analysis  to  (3.9).  Two 
phases  of  the  analysis  can  be  distinguished,  There 
la  first  a  transient  phase,  during  whloh  8.  H  not 
necessarily  moving  slowly  aven  if  c  la  small.  For 
sxample,  when  the  initial  parametrizatlon  ia  naar 
Instability  tt  Is  quite  possible  that  the  behaviour 
of  6a  ia  erratic  -  such  behaviour  has  bean  seen  in 
simulations.  After  the  transient  phase,  8.  may 


then  move  slowly  and  begin  to  approach  a  constant 
tuned  setting.  Let  us  first  consider  the  terminal 
phase,  after  the  transients  have  disappeared. 

Suppose  that  as  t*«,  Oa ( t )-0a cKP,  where  8a  ia 
like  the  tuned  parametrizatlon  In  Fig. 2.  In  the 
idea)  case  we  may  have  e>(t)«0  and  e>(t)u,<(l) 
exponentially  fast,  where  e.(t)  la  like  ,»(t)  in 
Fig. 2.  Thus,  neglecting  the  effect  or  the 
exponenltially  decaying  terms,  (3.19)  becomes 

4L  "  t*"*L  .  "  0  (3.16) 


”  H|»y  ( 6* )  ( ftift^) 

♦L  "  *ft  "  **^y(0O«)(ft*Bi<) 


$e  "  8* )M 

«ft  “  H#w(e«)w  -  H*w(e«)w  (3-17) 

Since  Hev(e«)  and  HfV(e«)  are  stable  LTI  operators, 
the  stability  of  (3.16)  and  (3-15)  depends  on  the 
stability  of 

4L  -  -  t8.Hev(8.)(?iei,)  (3.18) 

which  is  precisely  the  system  In  (3.10).  and  so  the 
methods  of  analysis  discussed  after  (3.10)  also 
apply.  To  reiterate,  the  main  difference  between 
(3-1S)  and  (3,9)  is  the  Input  magnitude.  Hence, 
11*1.11  Is  much  smaller  In  (3-19)  than  in  (3.9).  and 
so  the  limitation  on  linearization  validity  as 
expressed  in  (2.16)  Is  more  easily  satisfied. 

9,  TRANSIENT  ANALYSIS 

The  drawback  of  carrying  out  the  analysis  of 
the  error  system  described  by  (3-13)  Is  that  the 
properties  of  the  error  signals  e,#,S  depend  on  the 
properties  or  e. , 8.  as  generated  from  the  ideal 
adaptive  system  in  Fig. 3.  Although  a  global 
analysis  of  such  systems  is  available  [see,  e.g. 
Narendra,  Lin  and  Valavanl  (1980),  Kosut  and 
Frledlander  (1989)]  the  results  are  qualitative  and 
the  signal  bounds  are  coarse.  Thus  a  detailed 
description  or  the  Ideal  system  behaviour  Is  not 
available.  Since  the  averaging  analysis  referenced 
in  Section  3  can  describe  the  system  behaviour  In 
some  detail  for  small  c>0  with  stable  Initial 
parametrlzations ,  it  follows  that  a  transient 
analysis  of  the  Ideal  system  (Fig. 3)  Is  needed  for 
other  initial  parametrlzations.  This  Includes  both 
stable  and  unstable  initial  parameterizations.  At 
tne  same  time  we  need  a  transient  analysis  when  c 
is  not  necessarily  small,  or  as  small  as  required 
by  averaging  theory,  or  when  c  switches  from  large 
to  small  values,  e.g.  as  In  recursive  least  squares 
with  a  constant  forgetting  factor.  In  this  latter 
case  the  adaptation  mechanism  can  be  written  as  in 
LJung  and  Soderstrom  (1983), 

8  -  cR'‘»e  8(»)  -  6. 

R  -  c(8A'-R),  R(«)  -  R(  *) '  >  0  (9.1) 

Here  c  is  small  but  the  choice  of  R(0)  •  (c/o)l 
with  large  o  >  0  results  In  an  effective  large 
initial  gain  ol .  Other  schemes  can  also  be 
envisioned  when  t  in  (3.9c)  changes  size  depending 
on  the  size  of  e(t)  on  some  measure,  e.g.  ||e|| 
over  some  time  window  [t-T,t],  etc. 

The  Contraction  Mapping  Principle  (Theorem  1) 
can,  in  principle,  be  applied  to  the  transient 
analysis  problem.  Suppose  that  the  ideal  system 
(Fig. 3)  le  given  by 

4a  ■  -  C8a,a6a  ,  8#(0)  •  6(0)  -  8« 

8*  *  A  a  —  H,(0a)(,a6e) 
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Ttw  not Ion  is  a  tit  awkward,  tut  6«  •  e*-$«  wh err 
da  is  Um*  tariff  ti»«~  varying  astinat*  in  Fig.  3;  ia 
is  tiic  fixed  fip>«i  value  Tor  which  •*  -  Oj_aiid  #• 

is  the  final  rtgresso  function  when  da  •  da .  In 

order  to  utilize  In*  ora*  )  to  •atabllsh  the 
transient  behaviour  of  (*.?)  it  is  necessary  to 
Identify  the  operator  T  and  the  closed  space  H. 

For  example,  one  choice  is  to  define  T  as  the 

sapping  a«o  defined  implicitly  by 

d  *  -Cdd'®.  «<0)  •  |(0)-da 

d  -  -  Nf  ,(•«)(#»  <*.3) 

The  set  M  can  be  chosen  as 

H  -  UcCto.T]:  ||«-eB||  *  1.  ig( t)  - 


||f||H-  *up  |f(t)| 
tflO.T] 

where  BcRP*P  with  ReX(B)  <  0.  The  relationship* 
between  B.T,  and  n  are  critical  In  the  transient 
and  will  be  dlacuaaed  In  the  presentation  as  space 
In  the  paper  Is  Halted.  Note  also  that  the 
transient  analysis  can  be  carried  out  by  appealing 
to  the  Scnauder  Fixed  Point  Tneorea  (Theorem  3) 
where  the  space  H  above  is  aodlfled  so  that  •  Is 
slew  United,  l.e.  |8(t)  -  a(i)|  1  k|t-i), 
Vl.icCO.TJ.  Then  M  la  a  conpact,  convex  subset  of 
C[0,T]  as  discussed  after  the  statement  of  Theorss 
3.  In  this  latter  case  we  are  only  able  to 
establish  existence.  However,  if  the  nap  T  is 
locally  contractive  on  [0,T],  then  uniqueness  la 
also  established. 


CONCLUSIONS 

Two  fixed  point  theoren  (FPT)  on  Banach  spaces 
have  been  presented  alongside  an  operator 
formulation  of  adaptive  control  In  terns  of  signals 
O.e.e  -  collectively  denoted  x.  The  Banach  spaces 
under  consideration  are  various  function  apaesa 
fx(t):tcT)  tor  T  being  the  tine-index  set  and 
conpact  subsets  In  this  apace  correspond  to 
qol lections  or  neighbouring  (under  the  appropriate 
norm)  tine  functions.  A  fixed  point  of  the 
operator  equations  or  adaptive  control  la 
identified  with  the  complete  time  history  of 
x(*s,g,e)  for  the  adaptive  control  problem.  The 
role  of  the  FPT's  Is  to  allow  derivation  of 
conditions  for  these  signals  to  rsnaln  close  as 
functions  of  tine  to  nominal,  well-behaved 
trajectories  gv, dx.ee. 

The  formalism  of  appealing  to  these  FPT's 
dictates  that  a  formulation  such  as  (2.3)  la 
achieved  for  an  error  system.  Our  approach  to  this 
is  to  Involve  a  linearization  about  our  nominal 
values.  Local  contractlvlty  of  the  nonlinear 
operator  equation  is  Implied  by  exponential 
stability  of  the  linearized  adaptive  control 
problem.  Thta  implication  la  at  the  very  heart  of 
our  method.  Smallness  of  the  additive  operators 
1  due  to  unaodeled  dynamics,  linearization,  ate,,  is 
then  Invoked  to  prove  the  good  behaviour  of  the 
adaptive  control  system  vis  the  FPT, 

It  is  clear  that  there  are  many  aspects 
affecting  the  quantitive  application  of  these  ideas 
-  should  anyone  ever  dees  this  appropriate  -  and 


foremost  among  these  is  tv  ctiuice  of  tun  7 
signals.  Two  particular  natutal  choices  fur 
signals  we  advene.  .  which  have  respective 
advent  ages.  Oilier  ei  facie  are  Swell  Ih.na*  as 
sen*  1 1 1  v  1  ty  of  tha  com  rolled  plant  to  parameter 
variations  about  *•  (connected  with  the  va;  dlly  of 
linearization),  .dual  magnitude  ohoeen  for  the 
gain  i  (reflecting  the  trad'  off  between 
contraction  oonstant/eiponentlsj  degree  of 
sterility  and  perturbation  magnitude),  choice  of 
reference  signals,  ate.,  which  reflsc  the 
Influence  of  the  actual  plant,  apeclfisd  t  ntrol 
objective  and  law,  and  specific  adaptation  rule  on 
the  behaviour  o.'  the  complete  adaptive  aystam.  Our 
thesis  la  that  tha  an.iyala,  via  a.g.  FPT's,  of  tha 
signal-baaed  operator  formulation  deptetad  In 
Figura  I  la  a  particularly  natural  and  flaxlble 
approach  to  th.  analysts  of  robust  adaptive 
control.  This  echmque  provioes  considerable 
Insight  Into  the  qt.litallte  Issues  Implicated  in 
achieving  robust  adaptive  control.  The 
quantitative  issue*  require  very  much  more  specific 
site  Information  (Foubell  at  ml.). 
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ABSTRACT 


The  rate  of  parameter  convergence  in  a  number  of  adaptive  estimation 
schemes  is  related  to  the  smallest  eigenvalue  of  the  average  information 
matrix  determined  by  the  regression  vector.  Using  a  very  simple  example, 
we  illustrate  that  the  input  signals  that  maximize  this  minimum  eigenvalue 
may  be  quite  different  from  the  input  signals  that  optimize  more  classical 
input  design  criteria,  e.g.  D-optimal  criterion. 


Key  words:  Exponential  convergence,  persistence  of  excitation,  experiment 
design 


1.  PREAMBLE 


listen:  The  concept  of  "persistently  exciting"  (PE)  signals  has 

invaded  the  adaptive  systess  literature  at  an  exponential  rate.  Currently, 
■any  papers  on  adaptive  estimation  or  adaptive  control  contain  long 
derivations  proving  that  there  exists  sone  T>0,  some  to>0,  and  some  a>0, 
P>0  such  that  a  certain  regression  vector  Pit)  satisfies  the  following 
condition 

t+T 

PI  >  |  |  s(t)*T(t)  dr  >  otl  for  all  t>t0.  (1) 

t 

This  is  the  celebrated  persistency  of  excitation  condition.  The  regression 
vector  a(t)  can  take  many  forms,  depending  on  the  problem,  but  the 
following  form  is  typical: 

♦f(t)  =  - ! - -  Cu(t>  y(t)  y(t). . (t)3 

(**r,n  (2) 

We  have  assumed  a  single  input  single  output  (SISO)  system  for  simplicity, 

with  input  u(t)  and  output  y(t);  y  is  a  positve  constant  and  n  is  the  order 

of  the  system. 

It  is  beyond  the  reach  of  this  short  technical  note  to  dwell  on 
the  many  occurrences  of  the  persistency  of  excitation  condition  in  adaptive 
estimation  and  adaptive  control  theory,  but  for  those  readers  unfamiliar 
with  this  field  let  us  just  say  that  this  condition  is  often  appealed  to  to 
establish  the  exponential  convergence  of  a  linear  time-varying  error 
system.  This  insures  the  exponential  convergence  of  all  internal  variables 
to  their  desired  values  in  the  idealized  case  (constant  system,  exact  model 
matching,  etc.)  and  their  boundedness  in  certain  non-ideal  cases 
(time-varying  parameters,  unmodelled  dynamics,  etc).  And  so  it  goes. 

The  simplest  and  most  informative  occurrence  of  the  PE  condition 
is  in  the  analysis  of  gradient  algorithms  for  the  estimation  of  a  parameter 
vector.  The  error  equations  have  the  form 


where  eft)  is  the  parameter  estimation  error,  C>0  is  the  adaptation  gain, 
♦(t)  is  the  regression  vector  and  e(t)=-*T(t)e(t)  is  an  error  signal.  It 
can  be  shown  that,  subject  to  0  satisfying  (1),  (3)  is  uniformly 
exponentially  convergent  to  zero.  Further,  if  additionally  c  is  small 
(actually  cpT<<l)  then  the  convergence  rate  of  (3)  is  bounded  below  by 


where 


k  =  c  >_in(R)  +  0(e*) 


R  -  lim  4  [  0(T)eT(T)dT 
T-w»  1  1 


assuming  this  limit  exists.  That  is 
|e(t+r)  (  <  Ke-^left)! 

for  all  t  and  r  and  some  1<K<«»  fixed,  with  k  being  approximately  linear  in 
e  [1,2,3]. 

One  is  then .  drawn  to  ask  how  to  maximize  the  convergence  rate  of  the 
error  system  (3)  by  manipulating  designer  variables  -  specifically  c  and  0. 
In  most  adaptive  situations  the  algorithm  gain  e  is  constrained  to  be  small 
relative  to  the  regressor  magnitude  by  the  requirements  of  noise  rejection 
-  the  variance  of  the  parameter  error  in  adaptive  filtering  is  typically 
proportional  to  cP**  [4]  -  so  that  the  small  e  assumption  concurs  with 
engineering  dictates.  The  meaningful  subproblem  then  is:  given  that  c  is 
already  small,  how  can  we  best  choose  0  to  achieve  maximum  convergence  rate 
or,  more  fully,  with  0  determined  by  (2)  how  should  we  choose  u(t)? 

Our  aim  in  this  paper  is  not  to  develop  broad  new  frontiers  in  the 
robustness  of  adaptive  systems,  on  which  entire  books  could  be  written  [1], 
but  rather  to  analyse  critically  the  PE  condition  itself  with  a  view  to 
answering  some  of  the  practical  questions  raised  above.  The  overwhelming 
body  of  work  so  far  has  been  algebraic  in  nature  in  establishing  conditions 
for  the  regression  vector  0  of  some  particular  adaptive  system  to  satisfy 
(1)  for  some  a  and  P.  Our  discussion  above  shows  that  it  makes  good  sense 
to  keep  CP  (or  CP **)  small,  and  that,  if  CJJT  is  sufficiently  small,  the 
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convergence  rate  is  proportional  to  ea.  It  follows  that  it  ia  desirable  to 
generate  regressors  m(t)  that  will  ■  axial  re  «  and  The  question  we 
want  to  investigate  is:  "How  can  we  achieve  this  by  a  proper  choice  of 
u(t)?". 

This  ia  clearly  an  optinal  input  design-type  question.  Input  design 
ms  sore  fashionable  a  decade  ago  in  the  system  identification  literature, 
where  persistency  of  excitation  also  originated.  However,  most  of  the 
effort  was  aimed  at  maximizing  the  determinant  of  the  information  matrix 
(this  is  called  O-optimality),  rather  than  its  minimum  eigenvalue,  or  the 
inverse  of  its  condition  number.  In  this  note  we  examine  the  very  simple 
case: 

u(t)|  . 

*(t)  =  .  H(s)  =  -2-  ,  y(t)  =  Hfs)u(t) 
y(t')  I  S+a 

and  we  solve  the  optimal  input  design  problem  for  three  different  criteria. 
We  seek  the  input  u(t)  that  maximizes,  respectively,  'min^* 
^Snin^^max^  •  ®nd  T°r  comparison  purposes,  det  R.  Given  the  connections 
we  have  established  with  the  convergence  rate  of  an  adaptive  algorithm,  we 
shall  show  that  maximizing  the  determinant  leads  to  a  rather  poor  input 
design.  The  main  reason  for  its  popular  use  is  probably  the  simplicity  of 
computation  of  the  optimal  input. 

Some  of  these  issues  of  experiment  design  in  an  adaptive  systems 
context  have  been  raised  before  [5}  simply  to  emphasize  the  connection. 
Here  we  stress  the  unexpected  difference  between  adaptive  experiment  design 
and  optimal  off-line  experiment  design. 

One  reason  for  our  interest  in  this  question  arises  from  experimental 
attempts  to  generate  PE  signals  for  simple  linear  systems,  with  an  adequate 
"richness"  of  the  regression  vector  *(t>  leading  to  a  particular  minimum 
convergence  rate  of  the  error  variables.  It  is  often  thought  that  only  an 
academic  researcher  with  a  very  twisted  mind  could  generate  signals  that 
will  violate  the  PE  condition  (e.g.  try  u(t)  =  cost2  going  through  a  low 


pass  filter).  As  it  turns  out,  the  problem  is  not  to  generate  a  •( t )  that 
satisfies  (1)  for  some  a>0,  but  to  obtain  an  et  that  is  large  enough  to 
produce  a  reasonable  convergence  rate  for  the  adaptive  algorithm.  In  other 
words:  Row  does  one  turn  exponentially  slow  convergence  into  exponentially 
fast  convergence?  Finally,  we  wish  to  mention  that  we  are  by  no  means  the 
first  to  discover  that  exponential  convergence  can  be  exceedingly  slow,  and 
some  authors  have  conjectured  that  the  slow  convergence  was  probably  due  to 
a  poor  choice  of  input  signal  (see  e.g.  [6]).  Given  the  practical 
importance  of  the  question  we  raise,  it  is  surprising  that  almost  no 
attempts  have  been  made  to  answer  it.  The  purpose  of  this  note  is  to  give 
some  very  preliminary  answers  based  on  the  analysis  of  the  simplest 
possible  case.  We  believe  that  our  results  provide  a  lot  of  insight,  at 
least  for  us,  which  may  help  crack  the  more  general  case. 


input . 
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Obviously  (2.5)  and  (2.2)  imply  the  existence  of  R  (2.3),  moreover  R  is 
given  by: 


4m 

4m 

J  S(u)du 

|  S(w)ReH( ju)dj 

-• 

-m 

4m 

4m 

|  S(u)ReH(ju)du 

f  S(u)lH(ju)|*du 

-• 

- m 

(2.6) 


where  in  this  case 

(2.7) 

(2.8) 

We  compare  the  following  three  input  selection  criteria: 


»'"<*>  =  |  ^ 
~~7  *  1 
a2 

=  £  S53— 
7?  *  1 


Selection  Criteria: 

Over  the  class  of  input  functions  (u(t),  tcR4-),  which  have  a  power 
spectrum  (as  defined  in  (2.4)-(2.5))  and  which  satisfy  the  constraint: 


S(u)dJ  <  1 


maximise,  either 

(Cl)  det(R) 

or 

(C2)  *Bin(F) 

or 

(«)  >min^)amax^)- 


(2.9) 


Solution 


Define 


.'•>*»  w  »  i.  t»  i  •  '  .  w 
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/  *M  gr^ 

-»  a2 

4«t 

|  S(u)dw 


dw 


and 


M(«)  = 


<£ 

a 


c£ 

a 


b2 


Surprisingly,  we  have  that 


R  =  M(a)  |  S(u)dw 


and 


(2.10) 


(2.11)  + 


(2.12) 


0  <  a  <  1  (2.13) 

Therefore,  the  optimal  input  functions  acording  to  (Cl)  or  (C2)  satisfy 
(2.9)  with  equality,  whilst  for  (C3)  the  magnitude  of  the  total  input  power 
is  immaterial.  Consequently,  the  optimal  inputs  are  characterized  as: 


Cl-optimal:  f  S(u>  -j— ^ —  du  =  f  S(u)dw  =  1 

J  --y  +  1  J 


(2.14) 


C2-optimal:  f  S(w)  * —  dw  =  af;  f  S(u)dJ  =  1 

J  +  1  J 
-•a2  “ 


'2. IS) 


+• 


C3-optimal:  f  S(w)  -y-i —  du  =  or$s:  [  srw)dw  =  $  <  1  f 2. 16) 

J  ^  +  1  J 
-•a2  -• 


where  the  «Jc(0,l)  maximise  respectively  detM(a),  \iin^a)  and 
^mii/^^max^*)  over  oo e(0,l):  and  0  is  any  number  in  (0,1]. 


t  This  matrix  M  should  not  be  confused  with  the  M  matrix  of  Poubelle 
et.al. [61 . 


After  some  simple  calculations,  we  arrive  at: 


a}  =  1/2 


c  (0,H) 


(2.17) 

(2.18) 


•*■3* 


€  (0,1) 


(2.19) 


2^1 


Equations  (2. 14)-(2. 19)  characterize  all  "optimal"  solutions.  In  order  to 
get  some  more  insight,  we  verify  whether  there  exist  optimal  inputs  of  the 
form 


u(t)  =  J2  cosu*t,  t  €  R*,  u*  e  R* 
with  power  spectrum: 

S(ui)  =  H(S(u-u*)+S(uhj*)) 

Solving  for  cj*  we  find  respectively  for  Cl,  C2  and  C3: 


(2.20) 

(2.21) 

(2.22) 

(2.23) 

(2.24) 


For  this  type  of  input  (2.20)  we  collected  in  Table  1  the  relevant 
quantities  (det  R,  *nin(R),  xmin(R- /*ma.\(R^  •  as  a  function  of  b  and  a*  In 
Table  2,  the  same  quantities  are  displayed  for  h/a  =  1. 


For  the  purely  sinusoidal  input  u(t)  (2.20),  the  dependence  of  the 
determinant  of  R  on  the  frequency  u  is  displayed  in  Figure  1.  (Notice  that 
the  determinant  is  normalised  by  the  D.C.  gain  squared.)  The  minimum 
eigenvalue  and  the  condition  number  are  displayed  as  functions  of  frequency 
respectively  in  Figure  2  and  3.  The  full  line  corresponds  to  a  D.C.  gain 
of  10,  whilst  the  dotted  line  correponds  to  a  D.C.  gain  of  1. 


3.  DISCUSSION 


Consideration  of  input  sequences  that  max ini ze  the  minimum  eigenvalue 
of  R  (as  in  criterion  (C2)  of  the  preceding  section)  is  encouraged  in  the 
introduction.  This  point  is  argued  persuasively,  and  with  more  detail,  in 
[5].  What  is  not  examined  in  [5]  is  the  difference  in  the  subsequent  input 
choice  relative  to  the  more  common  objective  of  determinant  maximization 
[8j.  As  stated  earlier,  the  purpose  of  this  note  is  to  draw  attention  to 
this  difference  via  examination  of  a  simple  example. 

1.  The  most  immediate  observation  is  that  maximizing  the  minimum 
eigenvalue  of  the  information  matrix  yields  a  different  "optimal"  input 
sequence  from  the  one  derived  by  maximizing  the  determinant  or  the 
ratio  of  the  minimum  and  maximum  eigenvalues. 

2.  In  the  first  order  example  of  Section  2,  the  frequency  w*  of  the 

selected  sinusoid  is  the  breakpoint  (or  3dB)  frequency  of  the  plant  in 
(2.2)  with  determinant  maximization;  while  w*  is  larger  for  minimum 
eigenvalue  and  minimum-to-maximum  eigenvalue  ratio  maximization 
objectives.  In  fact,  as  the  D.C.  gain  (Va)  of  the  plant  increases,  so 

do  the  selected  input  frequencies  for  the  minimum  and  minimum/maximum 

eigenvalue  maximization  criteria. 

3.  One  interpretation  of  the  tradeoff  inherent  with  u*  selection  for 

minimum  eigenvalue  maximization  is  its  tendency  to  make  R  in  (2.12),  or 

equivalently  M(a)  in  (2.11),  equal  the  identity  matrix  by  attempting  to 

keep  the  plant  gain  close  to  one  while  simultaneously  attempting  to 
achieve  a  90'  phase  shift  in  order  to  null  the  off-diagonal  terms  on 
average.  Table  2  indicates  the  compromise  between  these  conflicting 
objectives  when  *Va  =  j.  This  interpretation  also  explains  why  the 
input  frequency  that  maximizes  Xn)jn(R)  increases  as  the  plant  D.C.  gain 
increases. 

4.  We  should  also  note  the  nonuniqueness  of  the  "optimal"  u,  unless,  as  in 
our  example,  the  input  power  is  constrained  and  {u}  is  assumed  to  be 


composed  of  a  number  of  sinusoids.  We  refer  to  [8,  Chapter  6]  for 
further  conent . 

5.  One  extension  of  "optimal"  input  selection  would  be  to  incorporate  a 
measure  of  sensitivity  to  imprecisely  known  plant  parameters.  Such 
plant  model  imprecision  is  actually  the  motivation  for  identification 
procedures  and  the  associated  input  selection.  The  example  in  Section 
2  clearly  indicates  that  the  "optimal"  input,  by  the  various  criteria, 
is  a  function  of  the  "unknown"  plant  parameters. 

6.  We  note  that  in  this  example  the  sensitivity  of  the  \)jn  design 
criterion  is  better  than  for  the  determinant  criterion  as  is  clear  from 
Figures  1  and  2.  Further,  for  all  three  criteria  the  penalty  for  using 
higher  than  optimal  frequency  appears  less  than  that  for  using  a  lower 
frequency  than  optimal,  and  the  sensitivity  for  the  >njn  criterion  in 
this  example  improves  with  increasing  D.C.  gain.  This  criterion  is  the 
one  of  prime  interest  for  the  convergence  rate. 

Finally,  as  discussed  in  [5],  optimal  input  design  questions  are 
perhaps  better  posed  in  an  adaptive  estimation  context  than  in  an  off-line 
identification  situation.  This  is  because,  as  the  adaptive  identifier 
learns  more  about  the  system,  the  input  signals  can  be  adjusted  according 
to  the  relevant  criterion.  The  insensitivity  to  imprecise  knowledge  of 
system  parameters  is  then  clearly  advantageous  and  of  relevant  concern. 

4.  CONCLUSIONS 

We  have  argued  that,  in  the  case  of  slow  adaptation,  the  smallest 
eigenvalue  and  the  condition  number  of  the  average  information  matrix 
determined  by  the  regression  vector  should  be  considered  as  input  design 
criteria  in  order  to  maximize  the  rate  of  exponential  convergence.  We  have 
then  performed  this  optimal  input  design  in  the  simplest  possible  case, 
which  allows  a  complete  description  of  all  optimal  solutions.  One  should 


*  be  very  careful  in  extrapolating  the  conclusions  of  this  simple  example  to 

i  more  general  situations,  but  we  believe  that  the  main  merit  of  our  note  is 

i  to  draw  attention  to  this  problem  because  it  follows  from  our  analysis 

|  that : 

j  1.  The  optimal  inputs  that  result  from  our  design  criteria  are  quite 

1  different  from  those  obtained  using  the  classical  D-optimality 

i 

I  criterion  for  optimal  input  design  in  off-line  parameter 

identification. 

2.  In  some  adaptive  control  schemes  it  is  presently  being  advocated  to 
|  concentrate  the  input  signals  in  low  frequency  regimes  in  order  that  an 

average  signal  positivity  condition  (related  to  strict  positive 
realness)  is  achieved.  The  results  here  indicate  that  this  may  cause 
an  attendant  decrease  in  the  level  of  persistence  of  excitation. 

Given  the  practical  importance  of  optimizing  the  rate  of  parameter 
convergence,  and  given  that  this  preliminary  analysis  points  in  a  direction 
opposite  from  presently  prevailing  ideas,  we  believe  that  this  problem 
deserves  much  more  attention. 


"Nyuk,  nyuk,  nyuk"  -  Curly  Howard 
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ABSTRACT 

A  method  of  averaging  is  developed  for  the  stability  analysis  of  linear  differential 
equations  with  small  time-vaiying  coefficients  which  do  not  necessarily  possess  an 
average  value.  The  technique  is  then  applied  to  determine  the  stability  of  a  linear 
equation  which  arises  in  the  study  of  adaptive  systems  where  the  adaptive  parameters  are 
slowly  varying.  The  stability  conditions  are  stated  in  the  frequency-domain  which  shows 
the  relation  between  persistent  excitation  and  unmodeled  dynamics. 
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1.  INTRODUCTION 


y 


For  a  large  class  of  adaptive  feedback  systems,  as  well  as  for  some  output  error 
identification  schemes,  a  stability  analysis  in  the  neighborhood  of  the  desired  behavior 
leads  to  investigating  the  stability  of  the  following  linear  system  of  differential- 
operator  equations  (see  e.g.,  [1]-[3J,  [20]) 

0  =  £[/■-$«(< t»'0)3  (1.1a) 

where  0(0)  =  0O  e  IRP,  e  is  a  positive  constant,  /[•),  <(>(•):  IR+  -*  JRP  are  regulated 
and  bounded,  and  H  is  a  linear-time-invariant  convolution  operator  with  kernel  h(t) 
and  transfer  function  H(s),  i.e., 

i 

=  J  h(t^i)u(z)dx  (1.1b) 

o 

We  consider  the  case  when  H(s)  is  strictly  proper  and  exponentially  stable,  thus,  h(r) 
is  bounded  by  a  decaying  exponential.  The  strictly  proper  assumption  is  not  necessary 
for  analysis,  but  it  is  more  often  the  case  when  (1.1)  arises  from  dynamical  systems. 
The  same  can  be  said  for  considering  the  general  convolution  (1.1b)  and  not  just  the 
case  of  rational  H(s). 

The  specific  problem  we  consider  is  slow  adaptation  (small  e  >  0),  and  to 
determine  sufficient  conditions  for  which  the  map  tf,%)  — >  0  defined  implicitly  by 

(1.1) ,  is  exponentially  stable,  i.e.,  there  are  positive  constants  K,  a  such  that 

t 

|0(r)|  <  j  +  Ke^\  0O  I  (1.2) 

o 

When  such  a  condition  exists,  it  then  follows  that  the  adaptive  systems  from  which 

(1.1)  arose  is  locally  stable. 

Linearization  and  Local  Stability 

In  [2],  for  example,  system  (1.1)  is  obtained  as  a  result  of  linearization  of  the 
adaptive  system  in  the  neighborhood  of  a  “tuned”  system,  i.e.,  a  system  where  the 
adaptive  parameters  are  set  to  a  constant  value  0*  £  1RP  and  whose  behavior  is 
deemed  acceptable.  Hence,  in  (1.1),  0(0  is  the  vector  of  parameter  errors  between  the 
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parameter  estimate  at  time  t  and  the  tuned  value  6*,  4>(/)  is  the  regressor  vector 
from  the  tuned  system  (e.g.,  filtered  revisions  of  measured  signals),  and  the  scalar  e 
is  the  magnitude  of  the  adaptation  gain  which  essentially  controls  the  rate  of 
adaptation.  The  operator  H  depends  on  the  actual  system  being  controlled  or 
identified  and  also  on  the  tuned  parameter  setting  6*. 

It  is  shown  in  [2,3]  that  if  system  (1.1)  is  exponentially  stable,  then  the  adaptive 
system  is  locally  stable,  i.e.,  the  adaptive  system  behavior  will  remain  in  a 
neighborhood  of  the  desired  behavior  provided  the  initial  parameter  error  0(0)  and 
the  effect  of  external  disturbances  are  sufficiently  small.  Although  the  results  in  [2,3] 
were  arrived  at  using  input-output  properties  [16],  the  local  stability  property  also 
follows  from  the  results  on  “total”  stability  [4],  [20]. 

Unmodeled  Dynamics  and  Slow  Adaptation 

In  the  ideal  case  there  are  a  sufficient  number  of  adaptive  parameters  (the  number 
p)  such  that  the  tuned  parameter  setting  results  in  H(s)  being  strictly  positive  real 
(SPR),  i.e.,  R cH(  j(0)  >  0,  V  to  e  IR+.  Under  these  conditions,  we  have  the  following 
results  (see  e.g.,  [5]-[8],  [1]):  (1)  system  (1.1)  is  stable,  i.e.,  0(r)  is  bounded  but  not 
necessarily  constant;  (2)  if,  in  addition,  $(r)  is  persistently  exciting,  then  system  (1.1) 
is  exponentially  stable.  The  trouble  starts  when  there  are  an  insufficient  number  of 
parameters  to  obtain  H(s)  €  SPR,  as  is  the  case  in  adaptive  control  when  the  plant 
has  unmodeled  dynamics  (see  e.g.,  [2,7],  [12]). 

In  this  paper  we  will  examine  the  stability  of  (1.1)  when  e  is  small,  <j>(r)  is 
persistently  exciting,  and  H(s)  is  not  necessarily  SPR  but  only  exponentially  stable. 
Riedle  and  Kokotovic  [9]  refer  to  this  case  as  “slow  adaptation”  and  by  using  the 
method  of  averaging  described  by  Hale  [10],  they  show  that  the  stability  of  (1.1)  is 
critically  dependent  on  the  spectrum  of  the  excitation  in  relation  to  the  frequency 
response  H(jca).  With  the  same  assumptions,  Astrom  [11]  uses  averaging  techniques 
to  analyze  the  interaction  between  unmodeled  dynamics  and  external  inputs  in  the 
counter-example  posed  by  Rohrs  et  al.  [12].  Both  these  analyses  require  the 
assumption  that  $(r)  is  almost  periodic  and  that  H(s)  is  rational.  In  this  case  Riedle 
and  Kokotovic  [9]  show  system  (1.1)  is  exponentially  stable  if 

xf  £  [a(to)a(©)*]ReH( >©)]  >  0  (1.3) 
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where  ft  and  {a(a>),  to  e  ft}  are,  respectively,  the  Fourier  exponents  and 
coefficients  of  Condition  (1.2)  can  be  considered  as  a  signal  dependent  positivity 
condition,  but  unlike  the  SPR  condition  R e//(  ji o)  is  not  required  to  be  positive  at  all 
frequencies. 

The  main  contribution  of  this  paper  is  to  extend  the  theoiy  of  averaging  to 
include  the  case  when  <>(r)  does  not  have  a  (generalized)  Fourier  series 
representation,  but  is  only  known  to  be  regulated  and  bounded.  Thus,  4>(r)  need  not 
be  almost  periodic  nor  even  possess  an  average  value.  We  also  state  stability 
conditions  in  the  frequency-domain  in  a  form  similar  to  (1.2).  Moreover,  H(s)  need 
not  be  rational.  Analogous  results  can  be  stated  for  discrete-time  systems,  see,  e.g., 
[13]. 

Averaging:  Uses  and  Limitations 

The  averaging  theory  developed  here,  as  well  as  averaging  theory  in  general,  has 
its  uses  and  limitations  for  adaptive  system.  In  the  first  place,  the  theory  requires  slow 
adaptation  which  can  be  counter-productive  because  performance  can  be  below  par  for 
the  long  period  of  time  it  takes  for  the  parameters  to  adjust.  Secondly,  the  averaging 
results  developed  in  the  sequel  concern  linear  time-varying  systems  only,  so  that 
application  of  these  results  to  the  nonlinear  adaptive  system  requires  a  linearization.  In 
this  sense  we  can  obtain  information,  including  frequency  domain  information,  about 
the  dynamical  behavior  of  the  adaptive  system  in  the  neighborhood  of  the  tuned 
system.  Both  stability  and  instability  conditions  are  discussed.  The  results  arising 
from  a  combination  of  small  gain  theory  and  perturbation  methods,  e.g.,  [2,  3,  14,  15], 
are  restricted  to  stability  results,  and  are  far  less  quantitative. 

Organization  of  Paper 

The  paper  is  organized  as  follows:  Section  2  develops  a  method  of  averaging  for 
linear  systems  with  sample  averages.  In  Section  3  we  apply  the  general  results  of 
Section  2  to  (1.1)  and  obtain  conditions  for  stability  and  instability.  In  Section  4  these 
are  interpreted  in  terms  of  frequency  domain  stability  conditions.  In  Section  5  we 
provide  a  general  discussion. 


Notation 


The  symbol  |  |  denotes  both  the  vector  norm  as  well  as  its  induced  matrix  norm. 
Similarly,  ||  }|p,  p  e  [1,H>  denotes  the  Lp-norm  of  a  vector  or  matrix  function,  i.e.,  for 

pe  (l.oo),  |lf|^  =  (|  |  F(t)  fdt)x'P,  and  ||F|L  =  ess  sup  {|  F(t)  \:  t  >  0}.  X,(4) 

denotes  the  Ith  eigenvalue  of  matrix  4  and  0,(4)  denotes  the  i**  singular  value  of 
A,  i.e.,  0,(4)  =  [A.,{4*4)]1/2.  An  operator  H  is  Lp-stable  if  3  constants  k,  b  such 
that  \\Hu\]p  <  &||u||p+b,  v  u  e  Lp.  The  smallest  k  is  referred  to  as  the  Lp-gain,  and 
is  denoted  by  yp(H). 

2.  METHOD  OF  AVERAGING  FOR  LINEAR  HOMOGENEOUS  SYSTEMS 

In  this  section  we  will  consider  the  homogeneous  linear  time-varying  system 

x  =  e4(r)x  (2.1) 


Lemma  2.1: 

Suppose  in  (2.1)  that  e  is  a  real  constant  and  4():  IR+  — >  IRnxn  is  regulated  and 
bounded.  Then  V  s,  x  €  IR+,  the  transition  matrix  F(s- K,  s)  of  (2.1)  is  given  by 

F(s+x,  s)  =  exp[et4T(r)]  +  R(s,  ex)  (2.2) 

where 


A,(r)  =  i  J  A{t)dt  (2.3) 

S 

is  referred  to  as  the  sample  average  value  of  4(r)  on  the  interval  s  <  t  <  s+x,  and 
11*0,  ex)|L  <  (ET||4|L)2exp(Et|i4|L)  :=  r(ex||4|L)  (2.4) 


Proof. 

Using  the  Peano-Baker  series  representation  for  the  transition  matrix  of  (2.1) 
gives: 

!  < 

S+1  —  J+T  *1  **-) 

F(s+x,  s)  =  /+£  J  Am+'E  E*  j  4(fj)  j  4(/2)  ...  j  A{t^dt\  ...  dtk 

i  k=2  s  s  f 


m»ji  "j aw  n  'j.  u  »jj  ijj  ■  j  w  *m*»  djw  >j.  »ji  gw  *btw» 
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Using  definitions  (2.2)-(2.3)  for  R(s,  ex)  and  At(j),  respectively,  together  with  the 
series  expansion  for  exp(exAt($))  results  in, 

R(s,  ex)  =  £[-(et4t(s))*  /*! 

*=2 

<i  r*-i 

+  £*  J  A(r!)J/4(r2) ...  J  ...  dt$ 

J  J  5 


<  2£(ex||/l|L)*/*!,  v  e  IR+ 

k=2 


=  (ex|HIL)2exp(exJJA|L) 

since  IIAjOIL  <  l|A(-)|| _ This  proves  (2.4). 


Remarks: 

1.  Assuming  that  A(t)  is  regulated  and  bounded  is  sufficient  for  the  existence 
and  uniqueness  of  solutions  [17]. 

2.  Observe  that  Lemma  2.2  is  valid  V  s,  x  e  1R+  and  V  e  e  1R.  In  the 
sequel  we  use  Lemma  2.2  only  for  the  case  when  e  >  0  and  ex  is  small. 

The  stability  properties  of  (2.1)  can  be  established  by  application  of  Lemma  2.2  as 
stated  in  Theorem  2.1  below.  We  first  require: 

Definition: 

The  function  gi(-):  JR"*"  -»  1R,  defined  by 

*t(M)  =  lim  (|/+oM|  -  l)/a  (2.5) 

aiO 

is  called  the  measure  of  the  matrix  M,  where  |  |  is  an  induced  matrix  norm  on 

For  any  induced  matrix  norm  and  its  corresponding  measure,  the,  following 
properties  hold  (see,  e.g.,  [16]): 


•»  *.  ••  *.  *.>F  vvv* 


(PI)  -|  M  |  <  -p(-M)  <  ReX(Af)  <  p(Af)  <  |  A/  |,  v  M  e  R"*" 
(P2)  H(A/!+A/2)  <  H(A/,)  +  h(A/2),  V  A/lfA/2  e 
(P3)  The  transition  matrix  F(t,x)  of  x  -  M{t)x  satisfies, 

t  i 

exp(-  J  p[-A/(.r)]<is)  <  |F(r,t)|  <  exp(J  p[A/(r)]<fc) 

X  X 

(P4)  If  the  vector  norm  on  IRn  is  lx|  =  (x'Px)112,  P  =  P'  >  0,  then 

\M\  =  max  oi(PmMP~m) 
i 

p(Af)  =  {max  kt{PmMrm+p-1/2M'Pm) 


(2.6a) 

(2.6b) 


(2.6c) 


(2.6d) 


(26e) 


These  properties,  together  with  Lemma  2.2,  yield  the  following  stability  result  for 
system  (2.1). 

Theorem  2.1: 

Suppose  j4(?)  in  (2.1)  is  regulated  and  bounded  with  the  sequence  of  sample 
averages  {Aj{kT),  v  A  e  Z+}.  Then: 

(i)  If  3  T>0  and  a>0  such  that 

ViAjikT)]  <  -a  ,  (2.7) 

then  3  T]  >  0  such  that  V  eT  e  (0,T|)  the  zero  solution  of  (2.1)  is  u.a.s. 

(ii )  If  3  T  >  0  and  a  >  0  such  that 

<  -a  ,  UeZ+  (2.8) 

then  3  T)  >  0  such  that  v  eT  e  (0,*n)  the  zero  solution  of  (2.1)  is 
completely  unstable. 


Proof. 


Combining  (2.6c)  with  (2.2)  gives, 

|exp[-etn(-A,(i)]|  <  |F(s+r,  j)-fl(s,et)|  =  |exp[ex4T(i)]| 
<  exp{etn(AT(r))]  ,  v  s,x  e  1R* 
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which  implies. 


(2.9) 

(2.10) 


v 


IfXs-K,  s)|  <  expletn(i4x(s))]  +  r(EXm) 
jF(s+x,  5)|  >  exp[-€tnMx(5))]  -  r(etm) 


where  we  have  used  (2.4)  with  ||4|L  -  m. 

We  first  prove  part  (i)  by  using  condition  (2.7)  and  inequality  (2.9)  with  t  =  7 
and  5  as  kT.  This  gives, 

\F((k+l)T,  kT)\  <  exp(-eTa)  +  r(eTm)  ,  v  ik  €  Z+ 

We  now  establish  that  for  all  small  e7  >  0,  (7((/:+l)7,  kT)\  <  1,  i.e.,  the  map 
9 {kT)  — *  0((Jk+l)7)  is  a  contraction.  From  the  definition  of  K  )  in  (2.4),  it  follows 
that  for  any  a  >  0  there  is  a  q  >  0  such  that 

exp(-i}a)  +  r(r}m)  =  1  (2.11) 

Hence,  for  all  e7  e  (0,rj),  there  is  a  P  >  0  such  that 

exp(-eTa)  +  r(eTm )  =  exp(-eTp)  <  1  (2.12) 

which  shows  the  contraction  property. 

Now,  for  any  t,s  e  IR^  with  t  >  s,  there  exists  an  integer  k  >  0  such  that 
s+kT  <  t  <  s+(k+l)T.  Thus, 


1F(^)|  =  \F(tj+kT)F(s+kT ,  s+(k-l)T)  ...  F(s+T,  s)\ 

<  \F(.tj+kT)\cxp(-eJcTP)  , 

<  |F(r,s+Jk7)|exp(-e(f-5-T)P)  ,  by  kT  >  t-s-T 

<  exp(er(m+P)exp(-e(r-5)P) 

The  last  line  follows  from  Property  (2.6c),  i.e., 

i 

\F(tj+kT)\  <  exp(  j  p[e4(T)]dT) 

*+*T 

<  exp(em(t-s-kT))  ,  by  n[4(x)]  <  |4(t)|  <  m 

<  exp(emT)  ,  by  t-s-kT  e  (0,7) 


rm  K  .  .j 

*•  *  ■ >  * 

- 

3 


s 


This  proves  part  (i)  of  Theorem  2.1.  The  proof  of  part  (ii)  follows  from  the  above 
analysis,  but  starting  with  inequality  (2.10). 

□ 

Using  the  same  technique,  but  allowing  A( •)  (equivalently  AT (•))  to  possess  a 
uniform  average,  we  obtain  the  following  sharper  result. 

Theorem  2.2: 

Suppose  A(t)  in  (2.1)  is  regulated,  bounded,  and  has  a  uniform  average 
A  e  TR™,  i.e., 

lim  AAs)  =  A  (2.13) 

T  - 

uniformly  V  ieR.  Under  these  conditions: 

(i )  If  3  a  >  0  such  that 

ReX(Aj  <  -a  (2.14) 

then  3  Eq>0  such  that  V  ee  (0,£o)  the  zero  solution  of  (2.1)  is  u.a.s. 

(ii )  If  3  a  >  0  such  that  ReX(A)  *  0  and 

maxReX(A)  >  a  (2.15) 

then  3  Eq  >  0  such  that  v  e  e  (0,e0),  the  zero  solution  of  (2.1)  is 
unstable. 

Proof. 

We  first  prove  part  (i).  Assumption  (2.13)  means  that  *  §  >  0, 

3  7X8)  >  0  such  that 

\Ajis)  -  A\  <  8  ,  V  1R+  (2.16) 

From  (2.9).  with  ||A(*)IL  =  m,  we  have 

|F(j  +  T,  s)|  <  exp[e7)x(A  +  Aj{s)  -  A]  +  (e7m)2exp(e7m) 

<  exp[eT(|i(A)  +  8)]  +  (e.Tm)2exp(eTm)  _  ( 

Since  ReX(A)  <  0,  there  is  a  constant  matrix  P  -P'  >  0  which  satisfies  the 


Lyapunov  equation, 


A'P  +  PA  +  21  -  0  (2.18) 

Now,  choose  as  a  norm  on  IR", 

W  =  {x?Px)m  (2.19) 

From  (2.6d)  and  using  (2.18)  we  then  have, 

p(A)  =  -  |ma x\i{rl'2(A'p  +  PA)P~1'2} 

2  j 

=  -min  =  -a  (2.20) 

I 

Hence,  (2.17)  becomes, 

|F(j  +  t,j)!  <  exp[-  eT(ol  -  8)]  +  (eTMfcxpieTM)  (2.21) 

By  assumption  (2.13)  it  is  always  possible  to  select  T(8)  in  (2.16)  such  that  6  <  a. 
By  inspection  of  (2.21),  there  then  exists  e©  >  0  such  that  V  e  e  (0,  e©), 
|F(j  +  T,  j)|  <  1,  V  s  e  JR*,  which  completes  the  proof  of  part  (i).  Part  (ii)  can  be 
proven  in  an  analogous  manner  starting  with  (2.10)  and  using  (2.18)  with  A  replaced 
by  -A.  Note  that  here  Re  MA)  *  0  is  required  explicitly. 

□ 

Discussion 

The  results  in  Theorem  2.9  and  Theorem  2.2  generalize  some  results  obtained  by 
averaging  methods  such  as  those  described  by  Hale  [10],  or  as  obtained  by  Coppel 
[18]  using  the  notion  of  integral  smallness.  Theorem  2.2  is  a  classical  result  of 
averaging  theory,  except  that  as  stated  it  allows  for  functions  which  are  not  necessarily 
almost  periodic.  The  class  of  functions  allowed  in  Theorem  2.2  —  regulated,  bounded, 
with  a  uniform  average  -  is  not  precisely  characterized.  Obviously  it  includes  the 
class  of  asymptotically  almost  periodic  functions  of  the  form 

m  =  A0(t)  +  Ax(f)  (2.22) 

where  A0(t)  is  almost  periodic  and  MjO)  €  Lf*",  p  e  [1,«>]. 

Theorem  2.1  considers  a  larger  class  of  functions  —  those  without  an  average  —  at 
the  expense  of  a  weaker  result:  the  stability -instability  boundary  is  not  as  sharp  as  in 


Theorem  2.2. 

An  example  of  a  function  which  satisfies  the  conditions  of  Theorem  2.1,  but  not 
of  Theorem  2.2  is: 

A(r)  =  A0  +  (1/  V2)j4i(sin  log  r+cos  log  t)  (2.23) 

where  A,-  =  A/  >  0,  i  -  0,  1  such  that  AQ-Ai  >  0.  This  function  does  not  have  a 
uniform  average,  as  can  be  seen  from 

j  J  A(t)dt  =  A0  +  sin  log (s+T)  -  j:  sin  log  sj  (2.24) 


However,  it  satisfies  the  conditions  of  Theorem  2.1  because  from  (2.23) 

j+r 

j  J  A(r)dr  >  Ao  —  A,  >  0  >  vr>0. 


(2.25) 


Condition  (2.7),  which  is  the  basis  for  the  u.a.s.  property,  has  some  interesting 
interpretations.  In  the  first  place,  since  a>0  is  a  constant,  conditions  (2.10) 
provides  a  uniform  bound  on  the  sequence  of  sample-average  measures 
{p[Aj<AT)],  k  €  Z+}.  From  the  definition  (2.5),  the  measure  is  dependent  on  the 
underlying  vector  norm.  Suppose  we  choose  as  the  vector  norm  (r|  =  (. x'Px)m  with 
P  -P'  >  0  a  constant  matrix.  This  was  done  in  the  proof  of  Theorem  2.2  where  P 
was  given  as  the  solution  to  (2.18).  In  general,  however,  we  have  from  (2.6e)  that 

mjikT)]  =  |  max  \{P-Xl2  [AT\kT)P  +  PAj{kT)]P-m}  (2.26) 


If  there  is  a  constant  matrix  P  =  P'  >  0  such  that 

4  max  \i{ATXkT)P  +  PAjikT)}  <  -1 


V  Z+ 


then  (2.7)  holds  with  the  choice 


a  =  min  ^{F-1) 


(2.27) 


(2.28) 


Observe  that  (2.27)  is  not  equivalent  to 

Re  \{Aj(kT)}  <0  ,  V  k  €  Z+  (2.29) 

This  latter  condition  means  there  is  a  sequence  of  matrices  {P(k)  -  P(k)'  >  0,  It  €  Z+} 
which  satisfy 


y 


(2.30) 


max  MV(*)P(*)  +  P^AjikT))  =  -2  ,  UeZt 

I 

Unfortunately,  it  makes  no  sense  to  choose  a  time-varying  norm,  e.g.,  W  =  (x'F(<:)x)1/2. 
Hence,  condition  (2.27)  provides  a  means  to  satisfy  (2.7),  provided  that  constant  P 
can  be  found. 

A  simple  sufficient  condition  for  (2.20)  is  that 

max  \i{A(k)  +  Aik)')  <  -  20k,  .  v  *  «  Z+  (2.31) 

i 

where  Og  is  a  positive  constant.  Hence,  we  can  take  P  =  (l/o t©)/  in  (2.27)  and  thus 
(2.10)  holds  with  a  =  Oq.  We  will  discuss  condition  (2.31)  further  when  we 
specialize  Theorem  2.1  for  adaptive  systems  in  Section  4. 

Theorem  2.1  also  requires  that  eT>0  be  sufficiently  small,  i.e.,  that 
zT  e  (0/n).  From  the  proof  of  Theorem  2.1  we  can  extract  a  value  for  q  and  also 
state  bounds  on  the  exponential  rates  of  growth  or  decay  of  the  transition  matrix 
F{t,\)  for  all  t  >  t.  Specifically,  we  have: 

Corollary  2.1: 

If  A(t)  is  regulated  and  bounded  with  ||A(-)|L  <  m,  then: 

(i )  Whenever  (2.10)  holds  for  some  T>  0,  the  zero  solution  of  (2.1)  is  u.a.s. 
v  er  e  (0,ti),  i.e., 

|F(//c)|  <  M  exp(  -  e(r  - 1){3)  (2.32) 

where  T|,  M,  and  P  satisfy: 

exp(  -  Tja)  +  r(T)m)  =  1 
M  =  exp(eT(m  +  JJ)  >  1 

exp(-e7T5)  =  exp(-ETa)  +  HzTm)  <  1  (2.33) 

(ii)  Whenever  (2.11)  holds  for  some  T>  0,  the  zero  solution  of  (2.1)  is 
unstable  v  er  e  (0,T)),  i.e., 

|F(r,T)|  >  M  exp (e(r  -  t)P)  (2.34) 

where  T|,  M,  and  P  satisfy  >  < 

exp(qo)  -  r{nm)  -  1 


M  =  exp(  -  eT(m  +  (5)  <  1 
cxp(eTp)  =  cxp(eTa)  -  r(cTm)  <  1 


(2.35) 


□ 

3.  STABILITY  OF  LINEARIZED  ADAPTIVE  SYSTEM 

In  this  section  we  apply  the  results  of  Section  2  to  the  linearized  adaptive  system 
(1.1)  under  slow  adaptation,  Le.,  small  e  >  0.  The  first  step  is  to  transform  (1.1)  into 
a  form  suitable  for  application  of  Theorem  2.1.  This  is  accomplished  by  a  time-scale 
decomposition.  That  is,  under  slow  adaptation  the  parameters  6(r)  change  much 
more  slowly  than  the  internal  states  of  the  dynamical  system  H.  This  suggests 
approximating  (1.1)  by  the  system 

6  =  £[/■-  (WOQ]  (3.1) 

for  which  Theorem  2.1  would  apply,  i.e.,  replace  A{t)  in  (2.1)  with  -e(<f>A/<t>')(0-  We 

start  with  the  following  intermediate  result,  developed  in  [20]  and  based  on  the 
discrete-time  formulation  in  [21]. 

Lemma  3.1: 

System  (1.1)  is  equivalent  to 

0  =  etf-rte  +  eWtfe)]  (3.2) 

where  R  is  the  time  varying  matrix 

R(i)  =  (4>Hf)(0  (3.3) 

and  Wlffi)  is  the  linear  integral  operator 

W.B)  =  WJf-  mW))  (3.3) 

with  Gq  the  linear  integral  operator  whose  kernel  is, 

T 

S*(f,T)  =  f  h(t  -  sW(s)ds  ,  0  <  X  <  t 


(3.5) 


which  is  also  a  Lyapunov  transformation,  i.e.,  both  original  and  transformed  systems 
have  identical  (Lyapunov)  stability  properties,  see  [24,  p.  117].  In  this  case  system 
(3.1)  is  equivalently  represented  in  state  form  as 


iS 

0  =  tm-WVz)  ,  0(0)  =  0O 

(3.7a) 

*  • 

i  =  Az  +  ixf>'(r)0  ,  z(0)  =  0 

(3.7b) 

Using  the  “L-transformation” 

£ 

4  =  z-L(/)0 

(3.8a) 

& 

where  L(r)  satisfies 

ss 

,y 

L  =  AL  +  bV(t) 

(3.8b) 

V'- 

gives 

§ 

e  =  e[/W  -  <t>(OcU(')0  +  4)] 

(3.9a) 

4  =  A%  -  eL(t)m  -  <Kr)cXL(r)0  +  £)] 

(3.9b) 
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Since  0(0)  =  0O  and  z(0)  =  0  by  definition  (1.1),  it  follows 

that  by  assigning 

1 

L(0)  =  0  we  have* 

R(t)  *  4>(r)cX(0 

(3.10) 

§ 

and  hence,  (3.9a)  becomes 

0  =  e[/(f)  -  R(t)Q  -  <Kr)c'4] 

(3.11) 

Since  £(0)  =  0  from  (3.8)  and  Re  X(A)  <  0  because  H(s)  is  stable,  it  follows  that 
^(f)  =  0(e).  Thus,  (3.11)  is  dominated  for  small  e  by  Q  =  e(f  -  RQ).  Consequently, 
both  the  “L-transformation”  and  the  operator  decomposition  (3.2)  arc  qualitatively 
equivalent  for  small  e. 

Using  Lemma  3.1,  we  now  state  conditions  for  exponential  stability  of  system 
(1.1),  i.e.,  the  map  (0O,/)  -»  0. 


Note  that  the  choice  of  initial  condition  for  L(0)  is  immaterial  when  discussing  asymptotic  stability  properties,  i.e., 
since  A  is  suble,  different  initial  conditions  give  rite  to  different  exponentially  fast  decaying  transients. 


Theorem  3.1: 

Assume  that: 

(A  1)  0  =  -eK(r)0,  R(i)  -  (<J>/f<j>')(0.  is  u.a.s.  with  transition  matrix  F(t,x) 
overbounded  by 

|F(/,t)|  <  WrtP(M)  ,  V  f>T>0  (3.12) 

(A  2)  The  impulse  response  h(t)  of  H  satisfies 

|A(/)|  ,  V  /  >  0  .  (3.13) 

Under  these  conditions,  3  £q  >  0  such  that  V  £  e  (0,£o),  system  (1.1)  is 
exponentially  stable.  Specifically,  if 

Cq  =  min{a/  P,  ej  ,  p^)  =  0  (3.14) 

then 

t 

10(r)|  <  M\  0O  k_Ep(e)'  +  /  ero(e)e-^(£)(Mtyi:T)|dT  (3.15) 

o 

where 

m(e)  =  M[  1  +  e||<MliK/(a  -  ep)2)  (3.16) 

p(e)  =  P  -  eM||4>|tiK2/(a  -  ep)2  (3.17) 

Proof. 

Using  the  decomposition  from  Lemma  3.1  gives  the  following  expression  for 
(11): 

0(0  =  F(r,O)0o  +  e(WV)(0  -  t2(W2Q)(t)  (3.18) 

where  Wj,  W2  arc  linear  integral  operators  given  by 

Wx  =  F{I  +  e<J>G*)  (3.19a) 

W2  =  F^m'  (3.19b) 

and  where  F  has  kernel  F(t,x).  We  first  show  that  Wx  and  W2  are  exp.  stable 
integral  operators.  !  * 


For  any  integral  operator  W  with  kernel  w(r,t)  we  have 

i 

t 

0 

Therefore,  W  is  exp.  stable  iff  3  o  >  0  such  that 

sup  |c°(,-Jt)w(/jX)|  <  oo 
r  >  t 

Let  the  superscript  notation  (  )°  denote  exponential  weighting,  i.e.,  (x°)(r)  =  e°‘x(t). 
Hence,  (tVu)°  =  W°u°  where  W°  is  the  linear  integral  operator  with  kernel 
Now,  following  pg.  119  of  [16],  let  ||l%  be  defined  by 

111%  =  sup  |w(r,t)|  (3.20) 

i  >  t 

Hence,  W  is  exp.  stable  iff  3  c>  0  such  that 

\m\b  <  ~ 

Observe  also  that  if  Gj  and  G2  are  linear  integral  operators  then 

IIG1G2II*  <  IIGillbYi(C2)  (3.21) 

Applying  these  relations  to  (3.19)  for  some  O  >  0  gives, 

iiw?iu  <  wrm + eii^ily^^)] 

HWflU  <  mW!<WllYi(GJ)Yi(«°) 

Choose  o  =  e(J  with  e€  (0,Eq),  Co  given  by  (3.14).  Using  (3.1 9)-(3.21)  gives 

n  <  M 

Yi(^°)  <  AT/(a-eP) 

Yi(Cf)  <  ML  */(a  -  e3)2 


Hence, 


I mit  <  A/[l+e||<|)||2  K/(a  -  ep)2]  =  m(e) 
l|W§IU  <  M||4>|ti  K2l(a  -  ep)3  =  [P  -  p(e)]/e 


£ 


where  m(e)  and  p(e)  are  defined  in  (3.16),  (3.17).  Going  back  to 
have. 


16(01  <  Me-*'  |  80  |  +  |  e-£P^){em(e)l/(x)| 


+  e[p-p(e)]|e(t)i}dx 

The  result  (3.15)-(3.17)  follows  by  directly  applying  the  Bellman-Grori 
(3.22). 


Discussion 

Under  slow  adaptation,  Theorem  3.1  shows  that  (1.1)  is  expone 
0  =  -e/?(r)0  is  u.a.s.  Hence,  we  can  apply  Theorem  2.1,  with  A( 
—cR(t),  and  arrive  at  stability  condition  (2.7),  that  is: 

System  (1.1)  is  exponentially  stable  for  all  small  e  >  0  if  3  T  >  I 


\l[  -  R(k)]  <0  ,  V  it  €  Z+ 


Dd\  .l.  ith 


wnere  uq  is  a  posiuve  constant.  \_omp 
which  means  the  interval  contraction  of 
Q\kT+T)Q(kT+T)  <  B'(kT)B(kT).  The  si 
Section  4  to  follow. 


tw 
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4.  FREQUENCY-DOMAIN  STABILITY  CONDITIONS 

In  this  section  we  reformulate  condition  (3.12)  in  the  frequency  domain.  This 
involves  the  Fourier  transform  //( job)  and  an  appropriately  defined  expression  for  the 
spectrum  of  <|>(r).  We  show  that  (3.12)  requires  that  <>(/)  have  a  persistent  excitation 
property,  and  that  the  dominant  excitation  be  at  those  frequencies  for  which 
Re  H(  job)  >  0. 

The  first  requirement  is  that  <}>(*)  be  restricted  to  those  functions  which  have  a 
Fourier  series  representation  on  any  finite  interval.  A  known  class  of  such  functions  is 
defined  as  follows  (see,  e.g.,  [19]). 

Definition: 

A  function  /(•):  1R+  ->  1R"  is  a  Cg  function  if  it  is  regulated,  bounded  and 
3  a  constant  5  >  0  such  that  any  two  points  tx,  r2  e  IR+  where  /(•)  is 
discontinuous  are  separated  by  at  least  an  interval  8,  i.e.,  |rj-r2|  >  6. 

Frequency-domain  stability  conditions  for  the  stability  of  (3.1)  can  now  be  stated. 

Theorem  4.1: 

Assume  in  (1.1)  that: 

(A  1)  \h(t)\  <  Ke™  ,  V 

t  >  0.  (4.1) 

(A  2)  6  with  piece-wise  Fourier  scries  representation  v  k  e  Z+: 

m  -  X  O k&W'*  .  [kT,(k+\)T)  ,  T  >  5  (4.2) 

CDE  n* 

where  Clk  is  the  set  of  distinct  Fourier  exponents  and  a*(  )  the  corresponding 
Fourier  coefficients.  Let  B(k)  e  TRpyp  be  defined  by^ 

B(k)  =  2  a*(a))ai(£D),//(-;£0)  ,  v  k  e  Z+  (4.3) 

to  €  n4 

^  Overtor  p")  denotei  complex  conjugation. 

!  < 
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Under  these  conditions: 

(i )  If  3  T  >  8  such  that 

m-£<*)]  <  -2H4>||!  (K/a?)/T  ,  V  k  e  Z+  (4.4) 

then  B  Eq>0  such  that  V  ee  (0,e0),  system  (1.1)  is  exponentially 
stable. 

(ii )  If  3  T  >  8  such  that 

p[fi(*)]  <  -2U4.nl  (K/a2)/T  ,  V  k  €  Z+  (4.5) 

then  3  Cq  >  0  such  that  V  e  £  (0,£o).  system  (1.1)  is  unstable. 


Remarks: 

1.  The  representation  (4.2)  for  4>(0  specifies  the  local  frequency  content  over 
t  €  [kT,  (*+1)7].  Such  a  representation  -  if  not  given  —  can  always  be  found  if 
4»(r)  £  [17];  then  (4.2)  can  be  obtained  via  the  Fourier  series  of  the  7-periodic 

function: 

4 >*(0  =  <j)(r  4-  mT)  re  ((4  -  m)T,  ( k-m+  1)71 
V  ieN  ,  VW£Z 

Notice  4 >*(r)  is  well-defined  on  ]R  and  has  a  Fourier  series  representation: 

4 >*«  -  £  ;  v  ,€  1R 

me  Z 


where 


and  hence 


a*  =  a!*  ,  com  =  2nm/T 


Ut)  -  £  at(/com)e'ou  ;  V  r  €  [*7\(*+l)71  (4.6) 

me  Z 

which  is  of  the  form  (4.2). 

2.  The  matrix  B(k)  can  be  equivalently  expressed  as  the  sample  average  value 
of  the  T'-periodic  part  of  (4>*//4**,)(0>  i.e., 

(*+1)7-  1  ( 

B(k)  =  1  J  4>*(r)V*(r)'<*  (4.7) 


v  y,  '  ■ 


C> 


ft 


Vv V 

O  v  *•  v 


where  y*(r)  is  the  T-periodic  part  of  //($*)(*)»  i.e., 

i 

V*(0  =  /  h(t-x)$k(x)dx 

=  £  Hij^a^e**  , 

neQt 


V  f€  R 


Proof. 

To  prove  part  (i),  it  follows  from  Lemma  3.1  and  Theorem  3.1  that  it  is  only 
necessary  to  show  that  (4.4)  implies 

<  0  ,  V  *  e  Z+ 

where  R(k)  is  given  by  (3.24).  We  start  by  defining 

B(k)  =  R(k)  -  B(k) 
with  B(k)  from  (4.3).  Using  (4.8)  gives 

<*+l)T 

B(k)  =  1  J  MOVk'W 


where 


=  <//<>)(/) -y*(f) 


=  |  h(t  -  t)<j>(x)dt  -  J  h{t  -  x)$k(x)dx 

KT  KT 

=  |  h(t  -  X)$(X)dX  ~  j  h(t  -  TX>*(T)rft 


The  last  line  follows  from  (4.6),  i.e.,  6*(0  =  4>(0  for  r  e  [kT,(k+\)T).  Using  (4.1) 


gives. 


IV*(0I  <  (2H6L  K/a)e~ 


from  which  it  follows  that 


W)\  <  2WE,  (K/a^yr  ,  v  *  e  z+ 

i  ( 

This  together  with  inequality  (2.6a)  proves  part  (i).  Part  (ii)  follows  analogously  by 


V'/VW/W  ,sv  o> ,V o 


replacing  R(k)  with  -R(k). 

□ 

If  MO  is  further  restricted  so  that  it  has  a  uniform  average,  then  we  can  sharpen 
the  stability-instability  boundary.  For  example,  if  MO  is  almost  periodic  then  a 
Fourier  series  representation  exists  V  t  e  IR+,  and  thus,  it  has  an  average  [10].  The 
stability  conditions  for  this  case  are  stated  as  follows. 

Theorem  4.2: 

Suppose  in  (1.1)  that  MO  is  almost  periodic  with  generalized  Fourier  series 

<i>(0  -  £  a(a>)e*“  ,  v  t  e  1R+  (4.9) 

CD  €  Q 

where  £2  e  1R  are  the  distinct  Fourier  exponents  and  {a(co),  to  e  Q)  are  the  Fourier 
coefficients.  Define  the  matrix  B  by 

B  =  £  a(co)a(to)'//(-yco)  (4.10) 

a>£  <2 

If  Re  X(B)  *  0  then  3  Eq  >  0  such  that  V  e  e  (0,Eo),  system  (1.1)  is: 

(i)  exponentially  stable  if  Re  X(B)  <0  (4.1 1) 

(ii)  unstable  if  max  Re  X,(B)  >  0  (4.12) 

i 

Remark:  The  proof  of  Theorem  4.2  is  entirely  analogous  to  that  of  Theorem  4.1. 

Theorem  4.2  is  the  result  obtained  in  [9]  when  MO  is  almost  periodic.  Theorem  4.1 

is  a  generalization  to  M  )  €  C%. 

5.  DISCUSSION  OF  RESULTS 

(A)  Effect  of  Transients  on  Sample  Average 

An  informative  inteipretation  of  stability  condition  (4.4)  is  that  the  average  energy 
in  the  T-periodic  part  of  (MJ/<fr/)(0  must  dominate  (or  overcome)  the  possibly 
negative  efforts  of  the  transient  terms.  In  other  words,  the  period  T  must  be 
sufficiently  larger  than  the  dominant  time  constant  of  H,  i.e.,  T  »  1/a.  Note  that  the 
term  2|K>Jii(K7a2)  essentially  arises  from  initial  conditions  or  stored -energ^  in  H  at 
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t  =  kT.  Obviously  when  $(0  has  a  uniform  average  it  is  always  possible  to  select  T 
to  be  sufficiently  large,  e.g.,  as  shown  in  the  proof  of  Theorem  2.2. 

Using  (2.27)  condition  (4.4)  holds  if  there  is  a  constant  matrix  P  -  P  >  0  such 
that  V  ke  Z+, 

min  \,{Q(k) J  >  1  ,  V  *  e  Z+  (5.1a) 

i 

where 

G(*)  =  {  X  Hirj^lPX^Xt^P]  (5.1b) 

OE  fit 

x^co)  =  a*(a»St((D)'  =  [Xkmr  (5.1c) 

(B)  Relation  to  Persistent  Excitation 

A  necessary  condition  for  the  existence  of  P  which  satisfies  (5.1)  is  that  for 
some  finite  integer  q  >  (p- 1)12  and  V  k  e  Z+, 

rank[a*(0)f  <**(©, ),...,  a^),  S^co,),...,  5*(co?)]  =  p  (5.2) 

If  this  were  not  the  case  then  min  QtfP)]  -0,  V  Jfc  e  Z+  and  V  P  -  P'  >  0. 

f 

Hence,  Theorem  4.1  implicitly  restricts  4K  )  e  Cf  to  those  functions  whose  (time- 
varying)  Fourier  coefficients  satisfy  the  rank  condition  above.  This  class  of  functions, 
however,  are  precisely  those  which  can  be  categorized  as  persistently  exciting  [1]: 


Definition: 

A  function  /(  ):  R*  — >  1R"  is  persistently  exciting  (FE)  over  an  interval  h  if  it 
is  regulated,  bounded,  and  3  constants  h  >  0  and  p  >  0  such  that 


nun  ffltm'dt^  >  P  ,  v  ie  R+  (5.3) 

Denote  such  functions  by  fi)  e  PEP(hfi). 

It  follows  from  the  definition  that  if  $(  )  €  PE?(h$)  n  C%  then  the  rank 
condition  (5.2)  will  hold  for  any  T  >  h  >  8,  and  thus,  (5.1)  may  be  satisfied  for 
some  matrix  P.  The  point  to  emphasize  is  that  persistent  excitation  is  not  sufficient 


•V'V-V'V  < 


for  stability,  except  in  the  case  when  H(s )  is  SPR  [1].  Thus,  we  can  view  (5.1)  as  a 
signal  dependent  positivity  condition.  In  general,  the  PE  condition  is  necessary  for 
stability,  but  as  seen  from  (4.5)  in  Theorem  4.1,  even  if  it  holds,  the  system  can  be 
still  be  unstable. 

(C)  Parameter  Scaling 

The  matrix  P  in  (5.1)  can  be  viewed  as  a  scaling  of  the  parameter  vector.  That 
is,  if  (5.1)  holds  for  some  P,  then  for  all  small  e  >  0,  0  =  -e/?(f)0  is  u.a.s.  in  the 
sense  that  0(r)'P0(r)  — >  0  exponentially  fast  as  t  — »  «>.  Thus,  parameters  will  tend  to 
converge  with  different  scalings.  If  for  some  given  signal  $(r),  the  determined 
scaling  matrix  P  gives  unwanted  responses,  then  the  signal  can  be  reshaped  so  as  to 
produce  a  more  desirable  scaling.  The  difficulty  is  in  finding  the  matrix  P.  If  4>(r) 
is  almost  periodic  then  Theorem  4.3  holds,  and  we  can  take  P  as  the  solution  to 
PA+A'P  =  -21.  When  <j>(r)  has  a  sample-average,  there  is  no  simple  means  to  find 
P. 

If  there  is  sufficient  a  priori  knowledge  about  the  effect  of  parameters  on  the 
system,  then  this  information  will  provide  the  desired  scaling  in  the  following  sense. 
It  is  always  possible  to  prescale  0  and  then  select  P  =  —  /  where  a©  is  some 

cto 

positive  constant  With  this  choice,  condition  (5.1)  becomes, 

min  X,{  ]£  -  jco)Re[Xt(tD)]}  >  Oq  ,  v  jfc  e  Z+  (5.4) 

*  ©  €  (2* 

This  is  equivalently  expressed  as, 

nun  X,{  £  Re[//0'<a)]Re[X*(co)] }  >  cx</2  ,  v  JteZ+  (5.5) 

*  <0  E  (2a 

which  has  a  more  informative  interpretation  in  terms  of  the  usual  positivity  conditions 
on  H.  For  example,  a  strictly  proper  transfer  function  H(s)  is  strictly  positive  real 
(SPR)  if  it  is  exponentially  stable  and  3  constant  p  >  0  such  that  [16]: 

Re[//(/o>)]  >  pf//(/o>)l2  ,  V  o)  €  1R+ 

This  condition  must  hold  at  every  frequency,  whereas  (5.5)  requires  Re[//(/(o)]  >  0  at 
those  discrete  frequencies  in  R+  where  the  magnitude  of  the  input  spectrum  is  large. 
Conversely,  at  those  frequencies  in  F+  where  Re[//(/ci>)]  <  0,  the  magnitude  of  the 
input  spectrum  should  be  small.  Since  (5.5)  will  fail  if  ReH(jv>)  <0,  V  ©  e  1R+,  it 


follows  that  R e//(/<o)  >  0  at  some  frequencies,  hence,  the  motivation  to  refer  to  (5.1) 
as  a  positivity  condition. 

(D)  Bounds  on  e 

The  upper  bound  Eq  on  the  size  of  e  >  0  to  insure  stability  can  be  extracted 
from  the  proof  of  Theorem  4.1.  Looking  back  over  Theorem  3.1,  Theorem  2.1  and 
subsequent  discussions  we  have, 

eo  =  min  {a/p  ,  tx ,  £2}  (5.6) 

where  Ej  and  e 2  satisfy 

P  -  £,M||<>|iiA:2/(a  -  EjP)2  =  0  (5.7) 

exp(-  e2Ta0)  +  (E2ra0)2exp(e2ra<))  =  1  (5.8) 

Recall  from  the  proof  of  Theorem  3.1  that  Oq,  a,  P,  Af,  and  K  are  defined  from: 

<*+1)7 

/ 

*7 

M  =  exp(e7*(m+p)  (5.10) 

\h(t)\  <  K  exp(-  ou)  (5.11) 

exp(-  e7p)  =  exp(-  eTcxq)  +  (zT a0)2exp(e7' Oq)  (5.12) 

(E)  A  Limitation  Arising  from  Averaging 

Suppose  H(s)  is  SPR  and  (4.4)  holds.  Hence,  system  (1.1)  is  exponentially 
stable  for  all  small  e  >  0.  Since  (4.4)  holds,  it  follows  that  $(r)  is  persistently 
exciting.  However,  from  other  arguments  (see,  e.g.,  [1])  we  know  that  under  these 
same  conditions  the  zero  solution  of  (3.1)  is  u.a.s.  for  all  e  >  0.  Thus,  Theorem  4.1  is 
conservative  in  this  case  in  regard  to  the  limitations  on  £.  However,  when  H(s)  is 
not  SPR  Theorem  4.1  is  now  applicable  whereas  the  results  in  [1]  do  not  apply.  In 
fact  in  this  latter  case  when  £  gets  too  large  then  system  (1.1),  can  be  unstable,  even 
if  (4.4)  holds.  For  example,  if  in  (1.1)  $(r)  =  sin(0.35r)  and  H(s)  =  l/(s2+2s+2) 
then  condition  (4.4)  is  satisfied.  The  simulations  in  Fig  5.1  with  6(0)  =  1  shpw  that 
the  zero  solution  is  u.a.s.  for  E  =  4  but  is  completely  unstable  for  E  =  8. 


Figure  5.1 


Comparison  with  Averaging  Analysis  of  Stochastic  Recursive  Algorithms 

Comparing  our  results  with  the  ordinary  differential  equation  approach  (ODE), 
used  in  the  analysis  of  stochastic  recursive  algorithms,  [22],  [23]  we  notice  the 
following  differences: 

(1)  The  ODE  approach  can  deal  with  nonlinear  recursions,  whilst  our  analysis  is 
restricted  to  the  linear  case.  It  is  possible  to  extend  our  results  to  the  nonlinear  case, 
but  this  would  introduce  more  technicalities  (see  e.g.,  [20])  perhaps  obscuring  the  main 
idea  of  ‘ ‘local  averages.” 

(2)  In  the  ODE  approach  it  is  assumed  that  the  adaptable  gain  (our  e)  converges 
to  zero  (and  is  not  summable),  whilst  in  the  present  contribution  e  is  a  small  positive 
constant 

(3)  The  main  difference  lies  in  the  condition  imposed  on  the  regressor  vector 
sequence.  Typically,  the  ODE  approach  relies  on  an  ergodicity  or  mixing  assumption 
to  infer  the  existence  of  cesaro-mean  along  the  sample  paths  (  e  average).  Our 
conditions  only  involve  finite  sample  path  properties  of  the  regressor  vector  and  related 


quantities;  this  makes  the  concept  of  cesaro-mean  or  global  average  meaningless.  In 
this  sense,  the  ODE  approach  is  closer  to  [9a, b]  where  periodicity  or  almost  periodicity 
are  invoked  to  guarantee  the  existence  of  averages.  In  a  way,  our  conditions  allows 
for  a  second  slow  time  scale,  the  slower  time  scale  on  which  the  nature  of  the 
regressor  vector  is  allowed  to  change. 

(4)  The  present  approach  yields  instability  results  as  well,  a  point  not  touched 
upon  in  the  ODE  approach. 

6.  CONCLUSION 

We  have  presented  a  method  of  averaging  for  linear  time  varying  systems, 
allowing  one  to  deal  with  general  time  motions,  thus  removing  the  classical  restriction 
of  almost  periodicity. 

This  method  can  be  applied  to  the  nonlinear  adaptive  control  problem  after 
linearizing  the  system  in  the  neighborhood  of  the  tuned  solutions.  Both  (local) 
stability  and  instability  have  been  discussed.  The  conditions  obtained  to  guarantee 
local  stability  can  be  expressed  in  frequency  domain  terms. 
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